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PREFACE 


This  volume  is  part  of  a  five-volume  set  that  summarizes  the  research  of  participants  in  the  1997  AFOSR 
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TRANSIENT  STUDIES  OF  THE  EFFECTS  OF  FIRE  SUPPRESSANTS 
IN  A  WELL-STIRRED  COMBUSTOR 

J.M.  Calo 
Professor 

Division  of  Engineering 
BROWN  UNIVERSITY 
Providence,  RI 02912 

Abstract 


The  effects  of  three  different  fire  suppressants  on  the  transient  behavior  of  the  composition  of 
the  effluent  gas  from  a  combustor  were  investigated  in  an  experimental  program  conducted  at  the 
Well  Stirred  Reactor  (WSR)  facility  at  Wright-Patterson  Air  Force  Base  during  the  months  of  July 
and  August  1996.  A  quadrupole  mass  spectrometer,  that  was  originally  designed  and  constructed 
to  sample  rocket  exhaust  gases,  was  used  to  sample  the  effluent  gas  composition  from  the  WSR. 
These  experiments  employed  premixed  propane/air  and  methane/air  gas  mixtures  at  near- 
atmospheric  pressure  in  the  WSR.  Transient  measurements  were  performed  on  the  effects  of 
injecting  the  fire  suppressants  Halon  1301  (CF3Br),  pentafluoroethane  (C2HF5,  HFC-125),  and 
trifluoroiodomethane  (CF3I)  into  air/fuel  mixtures  at  various  concentration  levels  using  a  fast, 
pulsed  valve  system.  Selected  results  from  the  experimental  program  are  presented  and  discussed. 
It  was  concluded  that  the  three  fire  suppressants  exhibit  some  similarities  in  their  behavior  in 
combustion  environments,  as  well  as  some  distinct  differences 

Results  are  presented  from  a  numerical  model  of  the  WSR  combustor  system.  This  model 
was  developed  to  interpret  the  resultant  transient  responses  of  gas  phase  species  measured  mass 
spectrometrically.  The  model  includes  the  effects  of  the  residence  time  distribution  of  the  jet  ring 
distributor,  as  determined  from  a  fluid  mechanical  model  developed  for  this  purpose,  and  the 
detailed  kinetics  in  the  well-mixed  combustor  and  the  sampling  line  to  the  mass  spectrometer.  The 
latter  two  components  of  the  model  are  based  on  the  CHEMKIN  n  package  of  codes,  especially 
the  perfectly-stirred  reactor  (PSR)  and  SENKIN  modules.  Work  has  also  been  done  on 
transforming  the  mole  fractions  reported  by  the  model  into  corresponding  mass  spectrometric 
signals.  Direct  comparison  of  model  results  with  the  combustion  data  has  been  initiated. 

The  transient  method,  coupled  with  model  analysis,  as  developed  here,  can  be  quite  useful 
for  the  identification  of  key  reactions  in  mechanisms  of  the  behavior  of  fire  suppressants  in 
combustion  systems.  This  approach  can  also  be  extended  to  determine  kinetic  rates  for  certain 
reactions.  This  effort  is  expected  to  improve  the  understanding  of  fire  suppression  mechanisms. 

The  unique  capabilities  of  the  REMSAVSR  apparatus  could  be  used  to  identify  and  measure 
in  a  definitive,  quantitative  manner  the  efficacy  of  candidate  fire  suppressants  using  only  minimal 
pulsed  amounts,  as  well  as  aid  in  the  assessment  of  the  important  environmental  effects  of  using 
fire  suppressants  in  combustion  systems  relevant  to  Air  Force  operations. 
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TRANSffiNT  STUDffiS  OF  THE  EFFECTS  OF  FIRE  SUPPRESSANTS 
IN  A  WELL-STIRRED  COMBUSTOR 


J.M.  Calo 


Background 

The  ban  on  the  use  of  Halon  1301  in  fire  suppression  systems  for  Air  Force  gas  turbine 
engines,  requires  the  identification  of  effective  replacements.  However,  thus  far  the  search  for 
more  environmentally  benign  replacement  agents  has  identified  only  less  effective  substitutes.  The 
interactions  of  chemical  agents,  such  as  Halon  1301,  in  combustion  environments  is  a  complex 
thermo-  and  physicochemical  process  which  is  still  not  completely  understood  [1,2,3].  In  fact,  it 
has  been  shown  that  even  Halon  1301  can  actually  enhance  combustion  under  certain  conditions 
[3].  Consequently,  the  search  for  safe,  effective  substitute  fire  suppressant  agents  would  benefit 
considerably  from  advancing  the  fundamental  understanding  of  the  interaction  of  fire  suppressants 
with  the  complex  combustion  gas  milieux.  This  is  the  primary  objective  of  the  work  reported  here. 

The  effects  of  three  different  fire  suppressant  agents  on  the  transient  behavior  and 
composition  of  the  effluent  gas  from  a  combustor  were  investigated  in  an  experimental  program 
conducted  at  the  Well  Stirred  Reactor  (WSR)  facility  at  Wright-Patterson  Air  Force  Base  during  the 
months  of  July  and  August  1996.  A  quadrupole  mass  spectrometer,  that  was  originally  designed 
and  constructed  to  sample  rocket  exhaust  gases,  was  used  to  sample  the  effluent  gas  composition 
from  the  WSR.  These  experiments  employed  premixed  propane/air  and  methane/air  gas  mixtures  at 
near-atmospheric  pressure  in  the  WSR.  The  parameter  values  for  the  three  principal  sets  of 
operating  conditions  of  the  WSR  system  for  methane  combustion  are  presented  below  in  Table  I. 

Transient  measurements  were  performed  on  the  effects  of  injecting  the  fire  suppressants 
Halon  1301  (CF3Br),  pentafluoroethane  (C2HF5,  HFC-125),  and  trifluoroiodomethane  (CF3I) 
into  air/fuel  mixtures  at  various  concentration  levels  using  a  fast,  pulsed  valve  system. 

In  order  to  analyze  the  resultant  transient  data,  a  numerical  model  of  the  WSR  and  mass 
spectrometer  sampling  line  have  been  formulated. 
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Table  I.  Nominal  (average)  WSR  methane  combustion  run  conditions. 


Air 

(slm) 

Methane 

(slm) 

Ring  T 
(K) 

Ring  P 
(kPa) 

WSRT 

(K) 

WSRP 

(kPa) 

WSRt 

(ms) 

(equivalence 

ratio) 

148 

5.5 

600 

137.2 

(128) 

1500 

100.5 

17.8 

0.56 

200 

14.8 

580 

m 

1790 

100.5 

11.1 

0.7 

390 

28.5 

573 

223.8 

(230) 

1770 

101.3 

5.7 

0.7 

Experimental 

The  Well  Stirred  Reactor  (WSR)  facility,  located  in  the  Fuels  and  Lubricants  Group  of 
Wright  Laboratories  at  Wright-Patterson  Air  Force  Base,  Ohio,  has  been  well  described  elsewhere 
[4,5].  The  reactor  is  cast  from  alumina  insulating  cement  in  two  halves  which  together  form  a 
toroidal  shape,  250  ml  in  volume.  Gas  is  admitted  into  the  WSR  via  32,  1.0  mm  I.D.  jets,  made 
from  alumina  tubes,  located  20°  off  radius  and  held  in  place  in  the  a  jet  ring  assembly  which  serves 
as  the  gas  distributor  for  the  reactor.  The  jets  intensely  mix  the  reactor  contents  and  are  responsible 
for  its  well-mixed  behavior.  The  gas  outlet  is  located  in  the  center  of  the  top  half  of  the  reactor,  90° 
to  the  feed  gas  flow,  and  exhausts  into  a  large  diameter  plug  flow  reactor  (PFR)  section.  Nominal 
reactor  operating  parameters  include  residence  times  on  the  order  of  6  -  18  ms,  operation  up  to  5 
atm  pressure  and  2300K  due  to  the  use  of  zirconia-coated  refractory  materials.  The  three  principal 
sets  of  operating  conditions  used  in  the  current  work 

The  three  different  fire  suppressants  were  introduced  into  the  WSR  via  a  fast,  pulsed 
solenoid  valve  manufactured  by  the  General  Valve  Corporation.  These  valves  were  originally 
developed  for  molecular  beam  laser  spectroscopy  experiments.  The  valves  are  set  for  2  ms  at  rated 
voltage,  but  can  achieve  even  shorter  pulse  widths  by  overdriving  with  a  300V  pulse  for  165  ps. 
The  valve  on  and  off  times  were  controlled  with  the  mass  spectrometer  microcomputer.  The 
minimum  “open”  time  used  in  the  current  experiments  was  5  ms,  which  was  the  same  as  the 


2-4 


resolution  of  the  mass  spectrometer  data  collection  system. 

The  effluent  gas  from  the  reactor  was  sampled  into  the  mass  spectrometer  via  an  air-cooled 
quartz  probe,  which  was  connected  to  the  bottom  of  the  WSR  through  a  drilled-out  stainless  steel 
Swagelok'*'^  fitting  with  a  graphite  ferrule.  The  opening  of  the  probe  was  positioned  1-2  mm  away 
from  the  toroidal  combustion  volume.  The  flow  rate  through  the  probe  was  controlled  by  a  pump 
which  was  throttled  to  approximately  8-10  slm.  The  probe  was  about  35.5  cm  in  length,  with  an 
I.D.  of  0.28  cm.  Under  typical  sampling  conditions,  the  probe  was  operated  at  temperatures 
greater  than  100°C  in  order  to  prevent  water  vapor  condensation. 

The  quadrupole  mass  spectrometer  was  designed  and  constructed  by  the  GPID  Branch  of  the 
Ionospheric  Effects  Division,  Geophysics  Directorate  of  the  Phillips  Laboratory.  The  gas  from  the 
sampling  probe  flowed  through  a  stainless  steel  cap  which  was  maintained  at  temperatures  greater 
than  100°C.  Part  of  the  gas  was  sampled  through  a  100  pm  orifice  into  a  two-stage,  differential 
vacuum  system,  pumped  by  three  turbomolecular  pumps.  The  volume  between  the  inlet  orifice  and 
the  skimmer  element  was  maintained  at  about  10'^  torr.  The  flow  sampled  through  the  skimmer 
was  admitted  to  the  mass  spectrometer  stage  which  typically  operated  at  about  10'^  - 10"^  torr. 

The  mass  spectrometer  was  controlled  via  a  digital  interface  with  a  laptop  computer.  Other 
than  for  diagnostic  scans  over  a  selected  mass  range,  the  mass  spectrometer  was  run  in  a  “mass 
programming”  mode.  That  is,  while  the  fast  solenoid  valve  was  pulsing  a  particular  fire 
suppressant  into  the  WSR,  certain  selected  mass  peaks  were  monitored  for  a  preset  sampling  time 
in  a  sequential  fashion.  The  signal  intensities  of  these  selected  mass  peaks  were  recorded  as  a 
function  of  time.  The  final  transient  response  is  actually  the  averaged  sum  of  the  responses  for  a 
large  number  of  fire  suppressant  pulses. 

Krypton  gas  was  used  as  an  inert  tracer  to  determine  the  time  response  of  the  reactor  system. 
Figure  1  presents  some  typical  data  obtained  for  the  ignited  reactor  operating  with  methane  as  the 
fuel.  As  shown,  the  krypton  pulses  reflect  the  varying  duration  of  the  valve  “open”  time.  Also,  as 
indicated  in  the  figure  caption,  the  apparent  characteristic  time  constant  for  the  pulse  rise  is  on  the 
order  of  of  45  ms  and  that  for  the  pulse  decay  is  22  ms.  These  are  both  significantly  longer  than  the 
WSR  residence  time  which  was  about  12  ms.  Analysis  of  the  transient  response  of  the  entire 
reactor  system  indicates  that  the  longer  time  constants  observed  are  caused  by  a  combination  of 
dispersion  of  the  tracer  upstream  of  the  jet  ring  where  the  pulsed  gas  mixes  with  the  feed  flow,  as 
well  as  dispersion  caused  by  the  time  delay  of  the  gas  feed  to  all  the  jets  in  the  jet  ring  distributor. 
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as  subsequently  analyzed  below. 
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Figure  1.  Krypton  signal  response  data  for  variable  duration  pulses  and  the  corresponding  exponential  fits  to  the 
data  for  the  ignited  WSR  operating  with  200  slm  air  and  14.7  slm  methane  (x  s  12  ms).  The 
characteristic  time  constant  for  the  pulse  rise  is  -45  ms,  and  22  ms  for  the  pulse  decay,  which  differs 
significantly  from  X. 


Experimental  Results  and  Discussion 


Halon  1301 

The  principal  ion  fragments  which  have  been  reported  for  electron  impact  ionization  of 
CF3Br  are:  69  -  CF3+  (100%);  148  and  150  -  CF3Br+  (11.2%);  129  and  131  -  CF2Br+  (8.9%); 
50  -  CF2+  (5.8%);  79  and  81  -  Br+  (5.8%);  41-  (CF+  (4.3%);  and  12  -  C+  (\.S%)  [6].  Since  this 
pattern  is  never  observed  in  the  current  work,  it  is  concluded  that,  in  general,  little  Halon  1301 
survives  the  WSR  under  the  current  experimental  conditions;  i.e.,  it  is  primarily  converted  to  other 
species.  This  is  entirely  consistent  with  the  anticipated  behavior  of  CF3Br  in  combustion 
environments.  Since  the  C-F  bond  energy  is  much  greater  than  that  of  C-Br  (~500kJ/mol  v^. 
295.4  kJ/mol,  respectively)  [7],  Halon  1301  readily  decomposes  via: 


CF3Br  — ¥  CF3  Br 


[R.1] 
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Indeed,  the  fire  retardant  properties  of  Halon  1301  have  been  attributed  to  the  reaction  of  the  Br 
and  CF3  radicals  liberated  in  this  manner  with  other  species  in  the  flame  such  as  to  remove  chain 
carriers  from  the  reaction  mechanism  [1,3]. 

Recent  work  suggests  that  at  short  residence  times  (~0.1  s)  under  steady-state  conditions 
only  a  small  fraction  (~8%)  of  Halon  1301  decomposes  via  [R.  1],  with  the  majority  (92%) 
reacting  with  methyl  radicals  to  form  CH3Br  via  [3]; 

CF3Br  +  CH3  CH3Br  +  CF3  [R.2] 

The  product  of  this  reaction,  methyl  bromide,  is  an  important  species  in  the  decomposition 
mechanism  of  Halon  1301  in  combustion  environments.  Indeed,  this  species  has  been  detected  by 
GC-MS  [3].  However,  the  parent  peaks  for  this  species  (masses  94  and  96)  were  generally  not 
observed  in  the  current  work.  This  is  undoubtedly  because  of  the  weak  C-Br  bond  (292.9  kJ/mol) 
[7]  and  the  fact  that  methyl  bromide  is  rapidly  consumed  to  produce  HBr.  Battin-Leclerc  et  al.  [3] 
computed  that  at  a  residence  time  of  0.5  s  the  consumption  of  CH3Br  is  primarily  via: 

72%  to:  CH3Br  -l-  H  ^  HBr  +  CH3  [R.3] 

24%  to:  CH3Br  Br  +  CH3  [R.4] 

In  addition  to  [R.3],  HBr  is  generally  produced  by  abstraction  of  H  by  Br  radicals  from  fuel- 
related  hydrocarbons,  as  well  as  by  direct  abstraction  of  Br  by  H  from  CF3Br: 

CF3Br  -f-  H  ^  CF3  +  HBr  [R.5] 

Due  to  all  the  various  sources,  HBr  is  the  major  stable  species  produced  by  the  decomposition  of 
CF3Br  in  combustion  environments.  Indeed,  this  is  the  prevalent  species  that  was  observed  in  the 
spectra  whenever  Halon  1301  was  introduced  in  the  WSR. 

It  was  previously  established  that  the  major  stable  products  from  the  addition  of  Halon  1301 
to  the  combustor  were  HF  and  HBr  [8].  Figure  2  presents  the  transient  response  of  HF+  (m/e  = 
20)  and  masses,  m/e  =  79  (79Br+),  80  (H79Br‘'‘),  81  (8‘Br+),  and  82  (H8lBr+).  As  shown.  The 
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79/81  ratio  was  typically  close  to  unity,  consistent  with  the  natural  bromine  isotopic  abundance  of 
0.507/0.493  [7].  As  shown,  however,  the  82  signal  was  significantly  less  than  that  for  80,  so 
there  are  probably  additional  contributions  to  80  than  from  H^^Br"^.  Electron  impact  fragmentation 
of  HBr  is  known  to  produce  Br"^  at  a  little  less  than  half  the  the  intensity  of  the  HBr'^  signal. 
Battin-Leclerc  et  al.  [3]  noted  that  the  Br  radical  concentration  should  closely  follow  that  of  HBr  in 
the  reactor,  except  at  very  long  residence  times,  although  it  is  less  by  almost  two  orders  of 
magnitude.  With  the  sampling  configuration  that  was  used  in  the  current  experiments,  Br  radicals 
probably  do  not  survive  the  gas  sampling  system  as  such,  and,  therefore,  there  must  also  be  some 
contribution  to  the  Br"*"  signal  as  ion  fragments  from  more  stable  brominated  species  in  addition  to 
HBr,  such  as  CH3Br,  and  Br2.  The  parent  ion  peaks  for  Br2''',  158  and  162,  were  not  observed. 
However,  due  to  the  weak  Br-Br  bond  (192.8  kJ/mol)  [7],  it  is  expected  that  Br+  would  be  the 
predominant  ion  produced  from  Br2. 

As  shown  in  Figure  2,  although  the  rise  of  mass  20  is  quite  similar  to  that  of  the  inert  krypton 
tracer,  its  decay  is  much  longer.  This  is  interpreted  as  being  indicative  of  the  flame  chemistry 
which  produces  HF  as  a  terminal  stable  product  via  a  number  of  different  reactions.  The 
HBr'^/Br+  signals  also  seem  to  follow  the  HF'*’  rise  and  decay  as  well,  consistent  with  the 
hypothesis  that  these  mass  peaks  too  are  indicative  of  stable  product  species. 
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Figure  2.  HBr'''/Br'*'  and  HF^  signal  responses  to  Halon  1301  pulses  of  50  ms  duration  into  the  ignited  WSR 
operating  with  200  slm  air  and  5.9  slm  propane  (T  =  12  ms).  The  Kr+  response  is  plotted  as  a  reference. 
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The  other  series  of  major  ion  peaks  associated  with  the  CF3Br  pulses  are  masses  47,  66,  and 
85,  as  discussed  above,  which  are  attributed  to  the  ions  CFO'*’,  CF20'*',  and  CF30'*’,  respectively. 
As  shown  in  Figure  3,  unlike  the  behavior  of  mass  20  (HF"*"),  which  exhibits  a  longer 
characteristic  decay  time,  the  response  of  masses  47  and  66  are  significantly  shorter,  following  the 
inert  krypton  tracer  response  quite  closely.  This  is  the  type  of  response  that  would  be  expected  as 
being  indicative  of  intermediate  product  species  in  the  flame  chemistry,  that  react  further  to  form 
more  stable  terminal  product  species.  As  shown  in  Figure  4,  the  responses  of  masses  47  and  66 
are  approximately  proportional  to  the  concentration  level  of  the  Halon  1301  pulses.  Their 
responses  also  exhibit  a  distinct  “double-peaked”  behavior  under  these  conditions. 

As  shown  in  Figure  5,  the  behavior  of  mass  85  is  decidedly  different  than  that  of  masses  47 
and  66  (as  shown  in  Figure  4).  Although  it  also  varies  monotonically  with  the  concentration  level 
of  the  Halon  1301  pulses,  it  does  not  exhibit  the  same  “double-peaked”  behavior,  and  definitely 
lags  the  responses  for  the  other  two  masses. 

Figure  6  presents  an  example  of  the  behavior  of  these  three  mass  peaks  with  WSR  residence 
time  ■  ’  is  quite  evident  that  both  the  relative  and  absolute  magnitudes  of  the  three  peaks  are 
sens  e  to  the  WSR  residence  time  on  the  time-scales  involved.  For  example,  while  mass  85  is 
the  .4  lest  signal  by  far  for  a  residence  time  of  19  ms,  it  becomes  the  largest  signal  for  residence 
time  )f  12  ms  and  6  ms.  Thus  mass  85  reflects  characteristics  of  a  species  which  forms  early  in 
the  :teractions  of  Halon  1301  with  the  combustion  environment,  and  is  then  consumed  in 
sul  uuent  reactions. 

Reaction  mechanisms  that  have  been  developed  to  describe  the  behavior  of  CF3Br  in 
conbustion  environments,  all  include  FCO  (mass  47)  and  CF2O  (mass  66),  carbonyl  fluoride 
[ :  .1.3],  The  latter  is  a  stable,  colorless,  pungent,  toxic  gas  [9]  which  can  certainly  survive  the  gas 
sar  pliug  system.  CF2O  is  produced  in  the  WSR  primarily  by  the  reactions: 

CF3  +  O  CF2O  +  F 
CF3  +  OH-4  CF2O  +  HF 

Since  these  two  reactions  are  both  much  faster  than  the  bond  scission  reaction  for  the 
decomposition  of  CF3Br,  [R.l],  it  is  logical  that  the  mass  66  (CF2O+)  signal  would  parallel  the 


[R.6] 

[R.7] 
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behavior  of  the  Br+/HBr+  mass  numbers,  at  least  early  in  the  reaction  mechanism,  and  that  it 
would  increase  with  increasing  levels  of  CFjBr  (i.e.,  CF3).  This  is  consistent  with  the  data 
presented  in  Figures  2  and  3.  FCO,  on  the  other  hand,  is  a  radical  species  which  cannot  survive  the 
gas  sampling  tube  to  the  inlet  of  the  mass  spectrometer.  However,  FCO"*"  can  be  produced  as  an 
ion  fragment  of  stable  species  such  as  CF2O,  or  CF3OF  (see  below).  In  Figure  4,  it  is  noted  that 
mass  47  signal  is  consistently  less  than  that  of  mass  66,  which  is  consistent  with  mass  47  being  a 
fragment  ion.  It  is  also  noted  that  the  “double-peaked”  responses  of  masses  47  and  66  become 
notable  only  when  the  mass  85  signal  is  high  (e.g.,  at  lower  residence  times),  and  that  the  mass  85 
response  coincides  quite  well  with  the  second  peak  of  the  “double-peaked”  responses.  This  is 
consistent  with  some  contribution  to  masses  47  and  66  due  to  fragmentation  of  the  parent  species 
of  mass  85  caused  by  electron  impact  in  the  ion  source  of  the  mass  spectrometer;  i.e.; 

CF3OH  (or  CF3  OF)  ^  CF30‘'’,  CFjO'*',  CFO+  (upon  electron  impact)  [R.8] 

as  well  as: 

CFjO  -4  CF20'*',  CFO+  (upon  electron  impact)  [R.9] 

Mass  85  must  arise  from  a  parent  species  of  at  least  molecular  weight  85  or  higher.  It  is  noted 
that  the  species  CF3OF  has  been  reported  in  the  literature  [10],  although  it  is  not  currently  included 
in  the  Halon  1301  combustion  codes  [1,2,3].  However,  from  the  various  Halon-associated 
radicals  that  are  present  in  the  combustion  system,  it  is  certainly  plausible  that  CF3OF  can  be 
produced  from  a  number  of  radical  termination  reactions,  such  as: 

CF3  +  FO  CF3OF;  AH°  =  -275  kJ/mol  [R.  10a] 

CF2O  +  CF2O  CF3OF  +  CO;  AH°  =  -377  kJ/mol  [R.  1  Ob] 

F2  +  CF2O  CF3OF;  AH°  =  -419  kJ/mol  [R.  lOc] 

which  are  all  exothermic,  as  calculated  from  enthalpy  of  formation  data  [7].  Ionization  of  CF3OF 
would  almost  certainly  result  primarily  in  the  production  of  the  fragment  ion  CF30'*’,  since  the  O-F 
bond  is  the  weakest  in  the  molecule  by  far  at  182  kJ/mol  [7].  Moreover,  the  parent  ion  peak  would 
be  mass  104.  It  is  noted  that  a  very  small,  but  measurable  amount  of  mass  104  was  typically 
detected  whenever  mass  85  was  most  intense. 
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Figure  3.  Mass  47,  66  and  85  signal  responses  for  0.75%  Halon  1301  pulses  of  50  ms  duration  into  the  ignited 

WSR  operating  with  150  slm  air  and  3.5  slm  propane  (x  =  19  ms).  The  Kr"*"  inert  tracer  response  is 
plotted  as  a  reference. 


Time  (ms) 

Figure  4.  Mass  47  and  66  signal  responses  as  a  function  of  Halon  1301  concentration  level  of  50  ms  pulses  into 
the  ignited  WSR  operating  with  200  slm  air  and  5.9  slm  propane  (x  ~  12  ms). 
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CF,I 

Due  to  its  chemical  similarity  to  Halon  1301,  the  behavior  of  CF^I  in  the  combustor  was 
expected  to  be  quite  similar  as  well.  The  products  of  the  interaction  of  CF^I  with  the  combustion 
environment  were  primarily  HF  and  HI,  which  were  observed  as  mass  peaks  127  (which  includes 
contributions  from  all  iodine-containing  species)  and  128  In  addition,  molecular  iodine,  12,  was 
evident  as  a  purplish  gas  in  the  effluent  from  the  combustor.  Mass  peaks  47,  66,  and  85  are  also 
present,  as  shown  in  Figure  7,  at  intensities  that  are  quite  similar  to  those  in  Figure  6  (although 
mass  85  is  somewhat  larger  for  CF^I,  under  similar  conditions).  The  major  difference  that  is 
apparent  in  Figure  7  is  that  mass  85  has  a  longer  characteristic  decay  time  than  for  masses  47  and 
66,  unlike  for  Halon  1301.  Also,  the  “double-peaked”  behavior  of  masses  47  and  66  is  not 
apparent,  and  the  three  signals  seem  to  superimpose  more  than  was  noted  for  Halon  1301.  These 
are  all  indicators  that  there  may  be  some  differences  in  the  combustion  reaction  mechanism  for 
these  two  species.  It  was  also  noted  that  for  similar  concentration  levels,  CF3I  seemed  to  be 
slightly  more  effective  insofar  as  bring  the  reactor  to  “near-blowout”  conditions. 


Time  (ms) 

Figure  7.  Mass  47,  66  and  85  signal  responses  for  0.75%  CF3I  pulses  of  50  ms  duration  into  the  ignited  WSR 
operating  with  200  slm  air  and  14.8  slm  methane  (x  =  12  ms). 
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Figure  8.  Mass  19,  20,  37,  47,  66  and  85  signal  responses  for  0.75%'  HFC-125  pulses  of  50  ms  duration  into  the 
ignited  WSR  operating  with  150  sIm  air  and  5.25  slm  methane  (t  =  19  ms). 


HFC-125 

HFC-125  (C2HF5)  exhibited  somewhat  different  behavior  than  the  other  two  fire 
suppressants  used  in  the  current  investigation.  Its  fire  suppressant  action  is  generally  believed  to  be 
more  physical  than  chemical.  This  was  substantiated  by  the  observation  that  it  took  about  an  order 
of  magnitude  more  HFC-125  than  the  other  two  agents  to  bring  the  reactor  to  “near-blowout” 
conditions.  In  addition.  HFC-125  behaved  as  fuel  in  that  the  reactor  temperature  pulsed  by  as 
much  as  50K  in  synchronization  with  the  HF-125  pulses.  Neither  CF3Br  or  CF3I  exhibited  this 
behavior;  i.e.,  the  reactor  temperature  always  remained  constant  during  pulsing. 

Figure  8  presents  the  behavior  of  masses  20,  47,  66,  and  85.  As  shown,  the  latter  three 
exhibit  behavior  similar  to  that  observed  for  CF3I.  In  this  figure  mass  85  is  the  smallest  of  the 
three,  but  this  is  primarily  due  to  the  longer  residence  time  of  the  WSR,  as  is  evident  in  the  data  in 
Figure  6.  In  addition,  the  decay  of  the  mass  85  response  appears  to  be  longer  than  that  for  mass  47 
and  66,  just  as  was  observed  in  Figure  7  for  CF3I. 

In  Figure  8,  the  mass  20  (HF)  response  is  plotted  on  a  linear  scale  to  accentuate  the  fact  that 
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the  pulse  peak  exhibits  a  minimum.  This  behavior  is  preceded  by  a  large  increase  in  the  mass  19 
(F*^)  signal,  accompanied  by  a  similar  increase  in  the  mass  37  (H20-H30‘^)  water  cluster  signal, 
which  was  used  to  monitor  water  indirectly  due  to  the  very  large  mass  18  parent  water  peak  which 
saturated  the  mass  spectrometer.  This  appears  to  be  related  to  the  apparent  behavior  of  HFC-125  as 
a  fuel.  That  is,  initially  it  decomposes  and  combusts  to  form  water  and  various  fluorine-containing 
species,  and  then  it  begins  to  participate  in  the  reaction  chemistry  in  a  fashion  more  similar  to  that 
of  CF3Br  and  CF3I.  If  this  is  indeed  the  case,  transient  data  such  as  these  for  HFC-125  and  similar 
species,  may  eventually  provide  improved  insight  concerning  fire  suppression  mechanisms  in 
general. 


Model  Formulation 


with  heterogeneous  wall  reactions 
using  CHEMKIN  code) 

Figure  9.  Schematic  of  the  overall  transient  model  for  pulse  injections  of  fire  suppressants. 


A  numerical  model  has  been  developed  in  order  to  analyze  the  transient  measurements.  The 
overall  model  consists  of  three  parts,  each  corresponding  to  one  of  the  three  physical  elements  of 
the  combustor  apparatus:  (1)  the  jet  ring  distributor;  (2)  the  well  mixed  combustor;  and  (3)  the 
mass  spectrometer  sampling  line.  The  basis  of  the  jet  ring  distributor  model  is  presented 
immediately  below.  The  combustor  and  sampling  line  models  are  based  on  a  package  of 
FORTRAN  codes  known  as  CHEMKIN  n  [1 1].  In  particular,  the  PSR  (perfectly  stirred  reactor) 
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module  from  this  code  [12]  was  used  to  model  the  behavior  of  the  WSR,  while  the  SENKIN 
module  [13]  was  used  to  model  the  constant-pressure  reactions  and  radical  wall-terminations 
within  the  sampling  tube.  A  conceptual  schematic  of  the  model  is  presented  in  Figure  9. 


1 


Figure  10.  Schematic  of  the  WSR  jet  ring  gas  disu-ibutor. 


Jet  Ring  Gas  Distributor  Model  -  Residence  Time  Distribution  (RTD)  of  the  Jet  Ring 
In  order  to  properly  analyze  the  transient  data,  it  is  necessary  to  know  the  time  response  of 
the  entire  system  consisting  of  the  pulsed  valve/jet  ring/reactor  combination.  The  residence  time 
distribution  (RTD)  of  the  WSR  is  known  to  be  exponential,  characterized  by  a  time  constant  equal 
to  the  reactor  residence  time.  However,  the  RTD  of  the  jet  ring  gas  distributor  which  feeds  the 
reactor  was  not  known,  and  thus  had  to  be  determined  for  the  development  of  the  model. 

A  schematic  of  the  jet  ring  gas  distributor  is  presented  in  Figure  10.  as  shown,  the  WSR  jet 
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ring  is  annular  with  an  O.D.  of  1 1.8  cm  and  I.D.  of  10.8  cm.  The  annular  ring  has  a  rectangular 
cross  section,  1  cm  wide  x  0.635  cm  in  height.  The  jet  ring  holds  the  32  jet  tubes  of  the  WSR  in 
place  at  its  I.D.,  and  serves  as  a  manifold/distributor  for  the  air/fuel  mixture  through  the  jet  tubes 
into  the  reactor  volume.  The  following  analysis  was  conducted  to  characterize  the  behavior  of  the 
jet  ring  distributor. 

Velocity  and  Pressure  Distributions  in  the  Jet  Ring.  The  jet  ring  serves  as  a 
manifold  distributor  for  the  WSR.  Its  behavior  can  be  described  using  the  energy  balance  and  the 
continuity  equation.  Using  cylindrical  coordinates,  it  is  assumed  that  the  gas  velocity,  density,  and 
pressure  in  the  jet  ring  are  uniform  in  the  z  direction. 

The  integrated  form  of  the  steady-state  energy  balance  on  a  control  volume  (c.v.)  is  given  by: 

lic.s.  P  V  (v2/2  +  w2/2  +  P/p)  -dS  =  -<D  [1] 

where  the  bold  symbols  represent  the  velocity  vector,  v,  and  the  normal  surface  area  vector,  dS, 
for  points  on  the  control  surface  (c.s.)  through  which  fluid  flows  into  and  out  of  the  control 
volume,  and  0  is  the  energy  loss  due  to  friction  within  the  control  volume.  In  this  expression,  the 
scalar  components  of  the  velocity  vector  vector  are  denoted  as  v  =  Vp  and  w  =  Vq.  Within  the  jet 
ring  distributor,  the  gas  velocity  is  well  below  sonic  such  that  the  fluid  may  be  treated  as 
incompressible  and  the  density  will  be  approximately  constant.  With  these  assumptions, 
application  of  Eq.  [  1  ]  to  a  control  volume  between  jets  in  the  jet  ring  of  rectangular  cross  section 
of  height  h  (h  =  0.25  in.  =  0.635  cm),  as  shown  in  Figure  10,  yields: 


-  pwh  (R2  -  R 1 )  (w2/2  +  P/p)|  e  +  pwh  (R2  -  R 1 )  (w2/2  +  P/p)|  =  -[pf(w2/2)/4)  w  dA,  [2] 

where  R]  and  R2  are  the  inner  and  outer  diameters  of  the  jet  ring,  respectively,  f  is  the  friction 
factor,  and  dA  is  the  total  differential  wall  surface  area  of  the  rectangular  duct.  The  friction  factor, 
f,  was  determined  from  the  empirical  expression  [18]: 

f  =  0.25/[log(E/(3.7  X  4Rj^)  +  5.74/ReO-9]2  [3] 

where  e  is  the  roughness  (assumed  as  6  x  10"^  in.  for  stainless  steel),  and  R^  is  the  hydraulic 
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radius  of  the  rectangular  duct: 


Rh  =  (R2-Rl)h/[2/(h+R2  -Ri)]  [4] 

and  Re  is  the  corresponding  Reynolds  number: 

Re  =  4  Rh  w  p/v,  [5] 

where  v  is  the  kinematic  viscosity  of  the  gas.  For  air  [19], 

V  =  1.458  X  IO-^tI-^AT  +  1 10.4)/(P/(287  T)),  m^/s 

where  T  is  the  temperature  in  K,  and  P  is  the  pressure  in  Pa. 

For  the  jet  ring  duct  of  rectangular  cross  section,  the  total  wall  surface  area  for  a  differential 
volume  element  of  angular  arc  d6  is: 

dA  =  [(R2+R I  )(R2'R  I  )+2R2h]  d0  [6] 

Taking  the  limit  of  Eq.  [2]  as  d6  0: 

w  dw/d0  +  (1/p)  dP/d0  =  -  (f  w^/8)  (j),  [7] 

where  the  geometric  factor, 

<j)  =  (R2+R|)/h  +  2R2/(R9-R|)  [8] 

The  integrated  form  of  the  steady-state  continuity  equation  for  the  control  volume  is; 

P  ^^c.s.  V  •  dS  =  0  [9] 

For  the  same  differential  control  volume  of  rectangular  cross  section,  this  expression  becomes: 
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d[p  w  h  (R2-Ri)]/d9  =  0,  or  dw/d0  =  0, 


[10] 


and  Eq.  [7]  becomes: 


dP/d0  =  -  (p  f  w^/8)  (j)  [11] 

Eq.  [11]  was  used  to  calculate  the  pressure  drop  in  the  control  volumes  between  jets  in  the  jet  ring 
distributor. 

The  pressure  change  due  to  removal  of  gas  from  the  jet  ring  via  the  jet  tubes  is  determined 
by  integrating  the  energy  balance,  Eq.  [1]: 

pW2  ■  P'^'l  (wi^/2  +  Pi/p)+  Pj  Uj  (tc/4)  d^  +  Pj/pj)  =  0  [12] 

where,  for  example,  Pj  is  the  pressure  upstream  of  the  jet  tube,  and  P2  downstream  of  the  jet 
tube,  and  the  subscript  “j”  denotes  flow  conditions  in  the  jet  tube,  and  d  is  its  diameter.  Frictional 
losses  were  ignored  in  this  expression  since  they  have  a  negligible  effect  on  the  pressure  in 
comparison  to  the  relatively  large  pressure  increase  due  to  the  removal  of  a  fraction  of  the  gas  from 
the  jet  ring,  as  well  as  the  fact  that  the  spatial  extent  of  the  region  is  small  in  comparison  to  the 
control  volumes  between  jet  tubes. 

The  continuity  equation  yields: 

pW2  h  (R2  -  Ri)  -  pw]  h  (R2  -  R])  +  pj  Uj  (7c/4)  d^  =  0  [13] 

Substituting  this  expression  into  Eq.  [12]  yields: 

pw2  (w2^/2  +  P2/P)  -  pwi  (wp/2  +  Pi/p)+  (wj  -  W2)  Pj  =  0  [14] 

In  this  expression,  the  total  specific  energy  of  the  fluid  in  the  jet,  (Uj^/2  +  Pj/pj)>  was  equated  to 
P[/p,  since  this  represents  the  stagnation  conditions  for  the  jet  flow,  which  is  assumed  to  be 
adiabatic  (see  below).  Solving  Eq.  [14]  for  P2: 
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p2  =  ?!  +  p  (wi^  -  W2^)/(2w’2) 


[15] 


The  jets  are  all  positioned  at  an  angle  of  20°  with  respect  to  the  normal  to  the  tangent  of  the  Jet 
ring  circle.  Therefore,  the  gas  velocity  in  the  ring  has  a  component  along  the  jet  tube  axis  of 
w  cos(70°),  directed  towards  the  jet  tube  opening  in  one-half  of  the  ring,  and  away  from  the 
opening  in  the  other  half.  However,  in  comparison  to  the  static  pressure  in  the  jet  ring,  this  velocity 
head  is  quite  small,  and,  therefore,  was  neglected. 

Mass  Flowrate  in  the  Jet  Tubes.  It  was  assumed  that  the  flow  in  the  jet  tubes  is 
adiabatic  and  compressible  since  they  are  well  insulated  from  their  surroundings  and  the  reactor, 
and  the  thermal  conductivity  of  alumina  is  quite  low.  In  order  to  calculate  the  mass  flowrate 
through  each  tube,  it  was  first  assumed  that  choked  (sonic)  flow  was  attained  at  the  tube  exit  (Mg  = 
1)  In  this  case,  the  Mach  number  at  the  tube  inlet,  M,  is  given  by  the  expression  [20]: 

( 1  -  M^j/kM^  +  [(k-i- 1  )/2k]  ln[(k  +  1  )m2/(2  -i-  (k  -  1  )M2]  +  {[  -  kM2)/(kM2)  =  fjL/d,  [  1 6] 

where  k  is  the  heat  capacity  ratio  of  the  gas,  L  is  the  jet  tube  length,  d  is  its  diameter,  and  fj  is  the 
average  friction  factor  for  flow  in  the  jet  tubes.  The  latter  was  determined  from  [21]; 

fj  =  0.0264  (To/Pq)0-23  +  0.45  d/(2L).  [17] 

where  Tq  and  are  the  temperature  and  pressure  in  the  jet  ring.  This  correlation  for  the  friction 
factor  has  been  experimentally  verified  for  the  jets  used  in  the  jet  ring  distributor  [22]. 

Eq.  [16]  was  solved  using  Newton’s  method  updating  the  average  friction  factor,  f,  at  each 
iteration.  The  ratio  of  the  pressure  at  this  point  in  the  tube,  P.  to  the  exit  pressure,  Pg,  was  then 
determined  from  [23]: 


P/Pg  =  ( 1/M)  [(k  -I-  1  )/(2  +  (k  -  1  )m2)]  1/2  ^  1  gj 

This  ratio  was  then  compared  to  the  ratio  P/P^,  where  Pq  is  the  reactor  pressure,  which  was 
approximately  atmospheric.  Choked  flow  at  the  jet  tube  exit  was  confirmed  by  checking  for 
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Pe  ^  Pq- 

If  Pg  <  Pq,  then  the  flow  at  the  jet  tube  exit  is  subsonic.  In  this  case,  the  correct  Mach 
numbers  at  the  tube  inlet  and  exit  must  be  found.  This  was  done  in  an  iterative  fashion  by  first 
assuming  sonic  flow  at  the  Jet  tube  exit  (Mg  =  1)  and  the  Mach  number,  M,  at  the  inlet  determined 
from  the  original  choked  flow  calculation.  These  values  were  used  to  determine  the  Mach  number 
at  the  jet  tube  exit  by  solving  the  following  transcendental  expression  using  Newton’s  method  [20]: 

P/Pg  =  (Mg/M)  [(2  +  (k  -  1  )Me2)/(2  +  (k  -  1  jM^)/]  1  /2  [19] 

The  resultant  value  of  Mg  was  then  used  in  the  following  expression  for  isentropic,  compressible 
flow  in  a  duct  of  constant  cross  section  [20]  to  yield  an  updated  value  for  the  Mach  number  at  the 
jet  tube  inlet,  M: 

1/(2m2)  -  [(k+l)/2]  ln(M)  +  [(k  +  l)/4]  ln[l  +  (k  -  1)  m2/2)  =  kfjx/d  +  C,  [20] 

where  C  is  a  constant  of  integration  evaluated  at  the  jet  tube  exit  (M  =  Mg,  x  =  L).  The  resultant 
value  of  M  (x  =  0)  was  then  compared  to  the  initially  assumed  value  and  the  entire  process 
involving  Eqns.  [19]  and  [20]  was  iterated  until  the  assumed  and  calculated  values  of  the  Mach 
number  at  the  jet  tube  inlet,  M,  agreed.  This  calculation  procedure  also  yielded  the  corresponding 
value  of  the  exit  Mach  number  which  is  updated  during  the  calculation,  as  well  as  the  average 
friction  factor,  f,  which  is  also  updated  during  the  course  of  the  iterative  calculation. 

The  resultant  Mach  number  at  the  jet  tube  inlet  was  used  to  determine  the  gas  density  at  this 
point  from  the  isentropic  flow  relations.  The  jet  tube  mass  flowrate  is  then  given  by: 

(dm/dt)j  =  p  M  (k  R  Tg)  1  /2  (7^/4)  [21] 

Calculation  Procedure.  Application  of  the  preceding  formulations  was  performed  in  the 
following  manner  to  obtain  the  pressure  and  velocity  distributions  in  the  jet  ring. 

(1)  From  the  gas  flowrate  in  the  inlet  feed  tube,  the  value  of  the  inlet  velocity  in  one-half  of 
the  jet  ring,  Wjj^,  is  determined  from  continuity: 
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Af  Vf- 2  Afjng  Wjn  [22] 

where  Af  and  Aj.jj^g  are  the  cross  sectional  areas  of  the  inlet  feed  tube  and  jet  ring,  respectively, 
and  vf  is  the  gas  velocity  in  the  inlet  feed  tube.  The  pressure  at  the  entrance  to  either  half  of  the  jet 
ring  is  given  by  the  Bernoulli  equation: 

Pin  =  Pf+p(Vf2-Win2y2,  [23] 

where  P^:  is  the  pressure  in  the  feed  tube  (5/16  in.  I.D.). 

(2)  The  calculation  then  proceeds  around  the  ring  in  one  direction  using  Eq.  [1 1]  to  determine 
the  pressure  at  the  end  of  each  control  volume  without  a  jet  tube,  and  Eq.  [15]  for  the  pressure 
change  across  each  jet  tube  inlet. 

(3)  The  flow  rate  through  each  jet  tube,  (dm/dt)j,  is  then  calculated  from  Eq.  [21]  as 
described  above. 

(4)  For  each  of  the  control  volumes  containing  a  single  jet  tube,  the  exit  velocity  from  this 
control  volume  in  the  jet  ring,  W2,  is  determined  from  the  continuity  equation,  Eq.  [13]; 

W2  =  w  1  -  (dm/dt)j/[p  h  (R2  -  R  i )]  [24] 

(5)  The  sum  of  the  flow  rates  through  all  the  jet  tubes  is  compared  to  the  total  flow  rate,  and 
the  pressure  in  the  feed  inlet  tube  is  adjusted  until  these  agree.  At  this  point  the  velocity  and 
pressure  at  the  inlet  of  the  jet  ring  will  also  agree. 

The  preceding  calculation  procedure  was  implemented  on  an  EXCEL™  spreadsheet  using  a 
MACRO. function  for  the  flow  through  the  jet  tubes  (sonic  or  subsonic). 

Jet  Ring  Distribution  Results.  Figure  1 1  presents  the  resultant  pressure  and  velocity 
distributions  for  the  case  of  390  slm  of  air  and  28.7  slm  of  methane  (<j)  =  0.7),  characteristic  of  the 
5  ms  residence  time  combustion  runs  that  were  conducted  in  the  WSR.  For  these  conditions,  sonic 
flow  was  attained  at  the  exits  of  the  jet  tubes.  (For  the  200  slm  and  150  slm  runs,  the  flow  was 
subsonic  in  the  jet  tubes.)  As  shown,  the  gas  velocity  in  the  jet  ring  decreases  discretely  at  each  jet 
tube,  corresponding  to  the  amount  of  gas  which  flows  into  the  reactor  at  that  point,  as  determined 
by  continuity.  The  pressure  in  the  jet  ring  decreases  due  to  wall  friction,  and  increases  due  to  the 
momentum  loss  at  each  jet  tube. 
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Figure  1 1 .  Pressure  and  velocity  distributions  in  the  jet  ring  for  a  flowrate  of  390  slm  air  and  28.7  slm  methane  ((]) 
=  0.7).  The  jet  ring  temperature  was  assumed  constant  at  300°C.  Sonic  flow  conditions  occurred  in  all 
the  jet  tubes. 
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Figure  12.  Mass  flowrate  distribution  corresponding  to  the  calculation  presented  in  Figure  11. 


Jet  Ring  Velocity  (m/s) 


The  overall  effect  is  to  progressively  increase  the  jet  ring  pressure  until  about  one-half  of  the  way 
around  the  ring,  whereupon  it  then  begins  to  decrease  to  match  the  inlet  pressure  at  the  end  of  the 
ring.  As  shown  in  Figure  1 1,  the  pressure  distribution  is  symmetric  with  respect  to  the  two  halves 
of  the  jet  ring.  This  is  also  reflected  in  the  flow  distribution  through  the  jet  tubes,  as  shown  in 
Figure  12.  Although  the  flowrate  increases  progressively  to  the  midpoint  of  the  jet  ring,  it  is  noted 
that  there  is  negligible  flow  maldistribution,  with  the  flowrate  differing  by  less  than  1%  from  the 
minimum  to  the  maximum;  i.e.,  practically  equal  amounts  of  gas  flow  through  each  jet  tube. 

The  corresponding  results  for  the  other  two  principal  flow  conditions  that  were  used  in  the 
current  work  are  qualitatively  similar,  with  the  model  calculation  agreeing  quite  well  with  the 
experimental  conditions.  The  largest  deviation  between  the  model  and  the  data  occurred  for  the  150 
slm  case.  There  are  a  number  of  possible  reasons  for  the  minor  discrepancies  noted.  First  of  all, 
the  pressure  in  the  jet  ring  was  not  measured  directly;  it  was  estimated  from  the  pressure  just 
upstream  of  the  feed  inlet  tube  at  the  “vaporizer.”  Another  problem  was  that  the  average 
temperatures  of  the  jet  tubes  were  probably  greater  than  the  measured  jet  ring  temperature.  Also, 
the  latter  temperature  generally  tended  to  increase  steadily  during  a  run.  The  value  cited  was  simply 
an  average  over  a  period  of  time. 

Contrary  to  the  results  of  the  preceding  calculations,  tracer  tests  conducted  on  a  room 
temperature  jet  ring  that  had  been  removed  from  the  reactor,  revealed  significant  fluid 
maldistribution.  An  example  of  these  data  are  presented  in  Figure  13,  which  is  a  compilation  of 
tracer  responses  from  selected  individual  jets.  For  perfect  distribution,  the  response  of  “mirror 
image”  jet  pairs  from  each  half  of  the  jet  ring  should  be  exactly  the  same.  As  shown,  however, 
although  the  responses  of  the  jet  pairs  superimpose  quite  well  near  the  feed  tube  inlet  (e.g.,  #1-32), 
they  progressively  separate  as  the  flow  proceeds  through  the  ring,  such  that  the  response  for  #20  is 
much  slower  than  for  #12,  and  it  is  even  slower  than  for  #16.  This  indicates  that  the  flow  is  quite 
asymmetric,  with  the  zero  velocity  point  occurring  beyond  7t  radians,  and  with  more  gas  flowing 
through  the  jets  located  from  n  to  27C,  than  for  those  between  0  and  K.  However,  this  jet  ring 
differed  from  the  one  that  was  used  in  the  combustor  for  all  the  current  experiments.  It  was  an 
older  one  fitted  with  all  stainless  steel  jet  tubes,  that  had  already  experienced  significant  service  in 
the  combustor.  Some  of  the  tubes  were  obviously  worn,  and  some  were  noncircular  and  partially 
“crimped”  at  the  exit.  Nine  of  the  jet  tubes  (#1,4,  8.  12,  16,  20,  24,  28,  and  32)  were  replaced 
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with  alumina  tubes  of  the  same  type  used  in  the  reactor  for  the  combustion  runs,  but  the  remaining 
23  were  the  original  stainless  steel  tubes  Consequently,  it  is  hypothesized  that  the  fluid 
maldistribution  resulted  from  jet  tubes  which  had  considerably  lower  conductances  than  the  others. 


Figure  13.  Tracer  (50  ms  CF3Br  pulses  at  room  temperature)  response  of  selected  jets  in  the  jet  ring  at  a  flowrate  of 
200  slm  of  air.  The  jets  are  counted  counterclockwise  from  the  inlet  feed  tube.  “Mirror-image”  jet  pairs: 
1-32,  4-28,  8-24,  12-20;  16  is  located  at  0  =  31e/32. 

In  order  to  demonstrate  the  effect  of  variations  in  jet  tube  conductance,  the  conductance  of 
tubes  1-16  were  arbitrarily  reduced  to  75%  of  capacity  in  the  model  for  the  390  slm  case.  The 
results  of  this  calculation  are  presented  in  Figures  14  and  15.  As  shown  in  Figure  14,  the  pressure 
and  velocity  distributions  are  clearly  asymmetric,  and  the  pressure  in  the  feed  tube  had  to  be 
increased  in  order  to  maintain  the  measured  flowrate.  Figure  15  shows  the  reduction  in  flow 
through  tubes  1-16,  and  the  considerably  larger  flow  that  was  forced  to  occur  through  tubes  17-32. 
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Figure  14.  Pressure  and  velocity  distributions  in  the  jet  ring  for  a  flowrate  of  390  slm  air  and  28.7  slm  methane  (<t) 
=  0.7),  with  jets  1-16  operating  at  75%  of  capacity..  The  resultant  inlet  feed  tube  pressure  is  265.5 
kPa. 
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Figure  15.  Mass  flowrate  distribution  corresponding  to  the  calculation  presented  in  Figure  14. 
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Jet  Ring  Velocity  (m/s) 


Response  to  an  Inert  Tracer.  A  tracer  mass  balance  on  a  jet  ring  control  volume 
including  a  single  jet  of  length  R2  A0,  (assuming  it  is  sufficiently  small  to  be  approximated  as 
well-mixed),  is  given  by: 

dC/dt  =  [Wjn  Cjn  -  C  -  (dm/dt)j  C/p]/R2  AG  [25] 

Substituting  for  (dm/dt)j/p  from  Eq.  [24]: 

dC/dt=[win(Cin-C)]/R2Ae,  [26] 


or,  in  finite  difference  form: 

Ct+At  =  tCt  +  (At/R2  ^6)  '^in  Cinl/[  1  +  (^^2  '^in]  ^27] 

The  solution  to  Eq.  [27]  will  give  the  transient  tracer  response  for  each  of  the  32  jets.  Although  this 
solution  inherently  assumes  well-mixed  regions  between  jets,  overall  the  32  regions  in  series  will 
behave  quite  close  to  a  plug  flow  system. 

The  concentration  of  tracer  in  the  total  effluent  from  the  WSR,  C(t),  is  then  given  by: 

VwsR  dC/dt  =  Z{(dm/dt)j  Cj  /p }  -  C  QwSR 

where  the  summation  is  over  all  (32)  jets,  Cj  is  the  tracer  concentration  as  a  function  of  time  at  the 
source  point  in  the  jet  ring  for  each  jet  tube,  determined  from  Eq.  [26],  and  and  QwsR  are 

the  volume  and  total  effluent  volumetric  flowrate  of  the  WSR.  In  finite  difference  form,  Eq.  [28] 
becomes: 


^t+At  =  [ At  S  { (dm/dt)j  Cj  /p }  W +  Cj]/[  1  +  At  (QA'^)wsr] 

A  numerical  scheme  to  calculate  the  transient  response  of  the  jet  ring  according  to  Eqns.  [27] 
and  [29]  was  implemented  in  the  form  of  an  EXCEL^^  spreadsheet.  Figure  16  presents  tracer 


responses  calculated  with  this  model  for  the  200  slm  case.  Since  the  pulsed  valve  has  a  response 
time  of  -0.5  ms,  the  input  to  the  jet  ring  is  the  square  pulse  depicted  in  this  figure.  Although  the  jet 
ring  distributor  does  not  mix  the  gas  per  se,  it  does  introduce  increasing  time  delays  to  the  gas 
emanating  from  each  jet  as  the  gas  flows  around  the  ring  from  the  feed  tube  connection  to  the  jets 
progressively  further  away.  Consequently,  the  effect  of  gas  distribution  through  the  32  jets  in  this 
manner  causes  a  broadening  of  the  distribution  of  residence  times.  This  can  be  clearly  seen  from 
the  curve  of  the  tracer  flowrate  from  the  jet  ring  into  the  WSR  (dashed  cur\  e),  which  is  already 
considerably  broadened  in  comparison  to  the  square  pulse  input  introduced  in  the  feed  tube.  This 
already  broadened  distribution  is  then  broadened  further  due  to  the  mixing  in  the  WSR  to  the  final 
pulse  in  the  effluent  from  the  WSR  presented  in  Figure  16.  However,  from  this  figure  it  is  quite 
evident  that  the  jet  ring  distributor  causes  most  of  the  resultant  broadening  in  comparison  to  the 
contribution  of  the  WSR. 


Figure  16.  Calculated  tracer  response  from  the  model  for  a  total  fiowrate  of  200  slm  air  and  14.7  slm  methane  (<|»  - 
0.7)  and  a  pulse  of  50  ms  duration. 

Figure  17  presents  the  calculated  tracer  responses  at  selected  individual  jet  tubes 
corresponding  to  the  same  conditions  depicted  in  Figure  16.  In  this  figure  the  solid  curves  are  the 
responses  from  jet  numbers  1,4,8,12,  and  16,  and  the  dashed  curves  are  the  corresponding 
responses  from  jets  20,24,28,  and  32.  The  effect  of  increasing  time  delay  as  the  gas  proceeds 
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around  the  ring  to  jets  progressively  further  away  is  clearly  shown.  In  addition,  since  the 
responses  for  the  “mirror  image  jet  pairs,  1:32,  4:28,  8:24,  and  12:20,  are  coincident,  this 
demonstrates  that  the  distribution  for  one-half  of  the  ring  is  perfectly  symmetric  with  respect  to  the 
other  half  of  the  ring. 


Figure  17.  Calculated  tracer  response  from  the  jet  ring  distributor  model  for  individual  jet  tubes  for  a  total  flowrate 
of  200  slm  air  and  14.7  slm  methane  ((])  =  0.7)  and  a  pulse  of  50  ms  duration. 

Figure  18  presents  inert  tracer  response  predictions  calculated  with  the  jet  ring  distributor 
model  compared  to  actual  krypton  tracer  data  of  varying  pulse  duration  from  10  to  200  ms.  As  can 
be  seen  from  the  figure,  the  actual  data  exhibit  a  somewhat  more  “sluggish”  response  than 
predicted  by  the  model;  i.e.,  the  model  rises  and  falls  at  a  somewhat  faster  rate  than  the 
experimental  data.  This  is  most  noticeable  with  the  50  ms  pulse,  for  which  the  model  predicts  that 
the  the  pulse  attains  its  maximum  krypton  concentration,  while  the  data  indicates  that  it  does  not. 
Apparently,  the  nominal  time  response  characteristics  that  were  assumed  for  the  pulsed  valve  did 
not  correspond  to  the  actual  time  response  of  the  valve  for  the  manner  in  which  it  was  operated. 
For  the  10,  100,  150,  and  200  ms  pulses,  however,  there  is  reasonable  agreement  between  the 
model  and  the  data.  In  any  case,  this  does  not  represent  a  significant  problem,  since  the  pulsed 
valve  time  response  characteristics  can  be  modified  to  match  the  experimental  data,  thereby  fixing 
the  actual  valve  behavior. 
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Figure  18.  Krypton  signal  responses  for  variable  duration  pulses  (in  milliseconds)  and  the  corresponding  jet  ring 
model  predictions  for  the  ignited  WSR  operating  with  200  slm  air  and  14.7  slm  methane. 

CHEMKIN-Based  Model  Elements 

The  first  step  in  formulating  a  model  of  the  combustor  system  was  to  develop  a  complete 
chemical  kinetics  input  file  from  published  chemical  reaction  mechanisms.  Specifically,  the 
combustion  mechanism  of  the  reactions  of  methane  in  air,  plus  all  the  additional  reactions  required 
fro  the  interaction  of  Halon  1301  with  the  gas  milieux.  This  was  accomplished  with  a  combination 
of  Konnov's  methane  combustion  mechanism  [14],  NIST's  CKMech  hydrofluorocarbon  and 
Halon  1301  mechanisms  [15],  and  the  Halon  1301  mechanism  of  Battin-Leclerc  et  al.  [3].  In 
addition,  species  identified  from  the  mass  spectrometric  data  (e.g.  CF3OF,  CF3OH,  etc.),  which 
were  not  included  in  any  of  the  published  mechanisms,  were  added  to  the  mechanism. 
Thermodynamic  data  and  rate  constants  for  these  species  were  found  in  the  literature  [16], 

The  resultant  chemical  kinetic  mechanism  is  the  input  to  the  PSR  module  of  the  CHEMKIN 
II  code.  However,  the  PSR  module,  as  originally  written,  employs  Newton’s  method  to  solve  the 
steady-state  species  mass  balances  and  the  energy  equation.  Furthermore,  the  equilibrium 
composition  of  the  species  at  the  reactor  temperature  is  used  as  a  starting  estimate  for  the  iteration 
[13].  Thus,  in  order  for  the  program  to  serve  the  requirements  of  the  current  model,  these  aspects 
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of  the  PSR  module  had  to  be  modified. 

One  modification  involved  replacing  the  Newton's  method  solution  of  the  steady-state 
species  mass  balances  with  a  solution  of  the  time-dependent  differential  equation  mass  balances, 
which  describe  the  evolution  to  the  steady-state  solution.  This  was  accomplished  by  bypassing  the 
solver  call  to  the  Newton's  method  subroutine,  and  replacing  it  with  a  call  to  TWOPNT,  which  is  a 
subroutine  that  time-steps  using  the  backward-Euler  method.  In  this  method,  the  time  derivatives 
are  approximated  by  finite  differences.  Once  this  was  accomplished,  a  readable  output  file  was 
created  in  spreadsheet  form  to  report  the  result  of  each  time-step. 

The  transient  Halon  1301  pulses  were  introduced  by  recalculating  the  input  composition  after 
each  time-  step.  In  order  to  accurately  model  the  input  pulse  of  fire  suppressant,  a  jet  ring 
distributor  model  (described  below)  was  developed.  The  pulse  shapes  determined  with  this  jet  ring 
model  were  fit  to  rising  and  falling  exponentials  using  Microsoft  s  Curve  Expert  software,  and  it 
was  these  fits  which  are  used  as  inputs  to  the  PSR  code.  When  the  program  is  run,  an  updated 
input  percentage  of  the  fire  suppressant  is  calculated  after  each  time  step  using  the  pulse  shape  fits, 
and  the  input  concentrations  of  all  other  species  are  adjusted  accordingly. 

The  final  necessary  modification  was  to  allow  for  a  starting  estimate  of  the  solutions  other 
than  thermodynamic  equilibrium.  Ideally,  the  model  would  run  by  starting  the  time  stepping  with  a 
reactor  composition  equal  to  the  results  of  steady-state  methane  combustion  at  the  reactor 
conditions.  This  modification  was  achieved  by  running  the  PSR  program  in  its  original  unmodified 
form  using  the  methane  combustion  mechanism  with  no  input  Halon  101  pulse,  and  retaining  the 
steady-state  solution  calculated  by  the  program.  This  steady-state  solution  was  then  used  as  a 
starting  estimate  for  the  modified  transient  PSR  calculation  by  utilizing  the  XEST  command,  which 
allows  for  user-  supplied  starting  estimates. 

A  listing  of  the  principal  subroutines  and  their  relationships  is  presented  in  the  Appendix. 

Implementation  all  the  preceding  modifications  results  in  a  robust  method  for  the  calculation 
of  the  WSR  composition  as  a  function  of  time  for  virtually  any  set  of  reactor  conditions.  The  model 
was  run  with  Halon  1301  input  pulses  at  a  variety  of  fuel-to-air  ratios,  reactor  temperamres,  pulse 
intensities,  and  flow  rates.  The  results  appeared  to  be  qualitatively  consistent  with  the 
corresponding  data,  and  showed  no  signs  of  spurious  or  chaotic  behavior  as  the  variables  were 
varied.  Sample  characteristic  output  data  for  Br2,  O2,  HE,  and  CH4  are  presented  in  Figures  19- 
23.  In  particular,  it  is  noted  that  the  model  results  for  the  behavior  of  methane  are  in  agreement 
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with  the  findings  of  Battin-Leclerc  et  al.  [3].  For  example,  in  these  latter  studies  it  was  found  that 
at  a  temperature  of  1070K,  for  space  times  (t  =  WSR  Volume/Feed  Flow  Rate)  less  than  1.5  s, 
Halon  1301  actually  increased  methane  conversion.  The  transient  data  in  Figure  23  support  this 
behavior  by  showing  a  net  increase  in  methane  conversion  from  the  steady-state  value  at  1773. 15K 
when  Halon  1301  is  pulsed  in  at  a  WSR  space  time  of  t  =  0.07s.  In  Figure  22,  transient  model 
results  for  t  =  10.5  ms  show  that  the  net  methane  conversion  at  1773.15K  does  actually  decrease 
after  an  initial  9  seconds  of  increased  methane  conversion.  These  results  demonstrate  that  the 
resultant  model  is  consistent  with  others  found  in  the  literature. 

The  model  results  have  still  not  been  exhaustively  compared  to  the  mass  spectrometric  data; 
this  effort  is  currently  ongoing.  Calibration  and  fragmentation  data  are  available  to  facilitate  this 
direct  comparison.  For  example,  mass  spectra  for  most  of  the  species  have  been  obtained  from  the 
NIST  Chemistry  Web  book  [16].  Examples  of  mass  spectrometric  results  predicted  by  the  PSR 
model  are  presented  in  Figures  24  and  25. 

However,  the  PSR  model  results  per  se  do  not  include  the  effects  of  the  sampling  line  from 
the  WSR  combustor  to  the  mass  spectrometer.  The  effects  of  any  additional  reactions  and  radical 
terminations  on  the  tube  walls  on  the  resultant  gas  phase  composition  are  not  taken  into  account.  In 
order  to  model  the  processes  which  occur  in  the  sampling  line,  the  constant  pressure  version  of  the 
SENKIN  module  was  applied  by  adapting  an  approach  that  has  previously  been  used  to  model  the 
variation  in  stratospheric  gas  composition  caused  by  sampling  through  a  tube  in  laminar  flow  [17]. 
The  gas  sampling  rate,  at  8-10  slm,  was  calculated  to  be  well  within  the  laminar  flow  regime,  with 
an  average  Reynolds  number  of  about  260.  The  residence  time  in  the  sampling  tube  was 
approximately  15  ms,  which  is  of  the  same  order  of  magnitude  as  the  duration  of  the  Halon  1301 
pulses.  The  gas  composition  from  the  WSR,  determined  from  the  transient  PSR  module 
calculations,  along  with  the  diffusion  coefficients  for  the  radical  species  (estimated  using  a 
technique  given  by  Fuller  [18]),  and  the  sampling  line  temperature  and  pressure,  are  the  inputs  to 
the  sampling  line  model. 

The  SENKIN  module  was  modified  to  enable  it  to  read  input  data  directly  from  the  PSR 
program,  thereby  eliminating  the  necessity  of  manually  transferring  composition  data.  The 
SENKIN  module  then  time-steps  through  the  entire  length  of  the  sampling  tube  (~15  ms)  for  each 
time-step  composition  that  is  reported  by  the  PSR  program.  This  final  element  of  the  overall 
transient  model  has  been  completed  and  is  currently  being  interfaced  to  the  PSR  module. 
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Figure  19.  Transient  behavior  of  Br2  calculated  from  the  PSR  model  for  a  1.38%  Halon  1301  pulse  of  50  ms 
duration  into  an  ignited  WSR  operating  at  1773. 15K  with  200  slm  air  and  14.7  slm  methane. 
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Figure  20.  Transient  behavior  of  O2  calculated  from  the  PSR  model  for  a  1.38%  Halon  1301  pulse  of  50  ms 
duration  into  an  ignited  WSR  operating  at  1773.15K  with  200  slm  air  and  14.7  slm  methane. 
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Figure  21.  Transient  behavior  of  HF  calculated  from  the  PSR  model  for  a  1.38%  Halon  1301  pulse  of  50  ms 
duration  into  an  ignited  WSR  operating  at  1773. 15K  with  200  slm  air  and  14.7  slm  methane. 
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Figure  22.  Transient  behavior  of  CH4  calculated  from  the  PSR  model  for  a  1.38%  Halon  1301  pulse  of  50  ms 
duration  into  an  ignited  WSR  operating  at  1773.15K  with  200  slm  air  and  14.7  slm  methane. 
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Figure  23.  Transient  behavior  of  CH4  calculated  from  the  PSR  model  for  a  1.38%  Halon  1301  pulse  of  50  ms 
duration  into  an  ignited  WSR  operating  at  1773. 15K  with  1.35  slm  air  and  0.1  slm  methane. 
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Figure  24.  Calculated  signal  responses  to  a  1.38%  Halon  1301  pulse  of  50  ms  duration  into  an  ignited  WSR 
operating  at  1773. 15K  with  200  slm  air  and  14.7  slm  methane. 
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Figure  25:  Modeled  and  calculated  signal  responses  to  a  1.38%  Halon  1301  pulse  of  50  ms  duration  into  an  ignited 
WSR  operating  at  1773. 15*K  with  200  slm  air  and  14.7  slm  methane. 

Summary  and  Conclusions 

The  WSR/REMS  apparatus  has  been  demonstrated  to  be  capable  of  monitoring  both  the 
steady-state  and  dynamic  behavior  of  the  WSR  combustor. 

It  was  concluded  that  the  three  fire  suppressants  exhibit  certain  similarities  in  their  behavior  in 
combustion  environments,  as  well  as  some  distinct  differences.  The  major  product  species 
produced  by  their  injection  into  the  WSR  combustor  are  HF  (for  all  three),  HBr  (for  CFjBr),  and 
HI  (for  CF3I).  All  three  fire  suppressant  agents  also  produced  significant  mass  peaks  at  47,  66, 
and  85,  which  are  attributed  to  the  ion  series  CFO+,  CF20''‘,  and  CF3O+,  respectively.  CF2O+  is 
most  probably  the  parent  ion  peak  of  the  stable  species,  carbonyl  fluoride.  High  levels  of  this  very 
toxic  gas  were  produced  at  short  reactor  residence  times.  Masses  47  and  85,  CFO+  and  CF3O+, 
are  most  probably  fragment  ions  from  stable  neutral  species  such  as  CF3OF  and  CF3OH.  These 
latter  species  should  be  included  in  fire  suppressant  reaction  mechanisms. 

A  numerical  model  of  the  WSR  combustor  system  has  been  developed  to  interpret  the 
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resultant  transient  responses  of  gas  phase  species  measured  mass  spectrometrically.  This  model 
includes  the  effects  of  the  residence  time  distribution  of  the  jet  ring  distributor,  as  determined  from 
a  fluid  mechanical  model  developed  for  this  purpose,  and  the  detailed  kinetics  in  the  well-mixed 
combustor  and  the  sampling  line  to  the  mass  spectrometer.  The  latter  two  components  of  the  model 
are  based  on  the  CHEMKIN  n  package  of  codes,  especially  the  perfectly-stirred  reactor  (PSR)  and 
SENKIN  modules.  Work  has  also  been  done  on  transforming  the  mole  fractions  reported  by  the 
model  into  corresponding  mass  spectrometric  signals.  Direct  comparison  of  model  results  with  the 
combustion  data  has  been  initiated. 

It  is  anticipated  that  the  transient  data  coupled  with  the  model  analysis  will  result  in  the 
identification  of  the  key  reactions  in  the  mechanisms  responsible  for  the  behavior  of  fire 
suppressants  in  combustion  systems.  This  approach  will  also  be  used  to  determine  kinetic  rates  for 
the  reactions  involved.  This  effort  is  expected  to  improve  the  understanding  of  fire  suppression 
mechanisms. 

The  unique  capabilities  of  the  REMSAVSR  apparatus  could  be  used  to  identify  and  measure 
in  a  definitive,  quantitative  manner  the  efficacy  of  candidate  fire  suppressants  using  only  minimal 
pulsed  amounts,  as  well  as  aid  in  the  assessment  of  the  important  environmental  effects  of  using 
fire  suppressants  in  combustion  systems  relevant  to  Air  Force  operations. 
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Appendix 


Description  of  Files  Comprising  the  Model 

FORTRAN  Code: 

CKInterp39.for  - 

LatestPSR24.for  - 
LatestTwopntllD.for 
PSRDriver.for  - 
Senkin.for  - 

CKStrt.h  - 
CKLib43.for  - 
Dasac.for  - 
Eqlib.for  - 
Math.for  - 
Stanlib.for  - 
Vode.for  - 

Input/Output  Files: 

CHEMKIN  Interpreter  (PSR): 

chem.inp  -  User-supplied  input  file  containing  the  necessary  elements,  species, 

thermodynamic  data,  and  chemical  reactions  for  the  PSR  model. 
chem.out  -  CHEMKIN  output  file,  displaying  the  input  and  reporting  any  error 

messages. 

therm.dat  -  Input  file  containing  the  thermodynamic  properties  of  all  the  species 

included  in  the  reaction  mechanism. 

PSRchem.bin  -  Output  file  opened  and  read  by  the  PSR  module. 

CHEMKIN  Interpreter  (SENKIN): 


CHEMKIN  code  that  reads  input  mechanisms  and  formats  them  to 
be  used  for  further  computations. 

Code  that  reads  in  the  parameters  of  the  reactor  and  feed  stream,  and 
calculates  the  composition  within  the  PSR  as  a  function  of  time. 

-  Computes  the  time-stepped  solutions  to  the  differential  equations 
formulated  by  the  PSR  model. 

Apportions  the  work  spaces  for  the  PSR  model  and  opens  all 
necessary  PSR  Input/Output  files. 

Apportions  the  work  spaces  for  the  sampling  line  model  and 
calculates  the  composition  at  the  end  of  the  sampling  line,  from  the 
output  from  the  PSR  model. 

File  required  for  CKLib43. 

Subroutine  library. 

Subroutine  library. 

Subroutine  library. 

Subroutine  library. 

Subroutine  library. 

Subroutine  library. 
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SENchem.  inp  — 

SENchem.out  - 
therm.dat  - 
SENchem.bin  - 

PSR  Module: 

PSRAnp  - 
PSR. out  - 
PSRrest.bin  - 

PSRrecov.bin  - 

PSRsave.bin  - 

PSRInputStream  - 

PSRExcelresults  - 

SENKINtransfer.bin  - 

SENKIN  Module: 
SENKIN.inp  - 
SENKIN.  out - 

SENrest-.bin  - 

SENsave.bin  - 

SENKINExcelresults  - 


User-supplied  input  file  containing  the  necessary  elements,  species, 
thermodynamic  data,  and  chemical  reactions  for  the  sampling  line 
model. 

CHEMKIN  output  file,  displaying  the  input  and  reporting  any  error 
messages. 

Input  file  containing  the  thermodynamic  species  of  all  the  species 
included  in  the  reaction  mechanism. 

Output  file  opened  and  read  by  the  SENKIN  module. 


User-supplied  input  file  containing  the  reactor  parameters. 

PSR  output  file,  containing  the  results  of  the  PSR  computations. 
PSR  output  file,  containing  the  final  solution  for  use  as  a  starting 
point  for  further  computations. 

PSR  output  file,  which  is  rewound  and  rewritten  after  each 
successful  series  of  time  steps. 

PSR  output  file,  written  after  the  successful  completion  of  the  entire 
simulation. 

Spreadsheet  format  file  which  displays  the  composition  of  the 
reactor  feed  stream  as  a  function  of  time. 

Spreadsheet  format  file  which  displays  the  composition  in  the  PSR 
as  a  function  of  time. 

PSR  output  file  which  serves  as  an  input  to  the  SENKIN  module. 


User-supplied  input  file  containing  sampling  line  parameter  values. 
SENKIN  output  file,  containing  the  results  of  the  sampling  line 
computations. 

SENKIN  output  file,  containing  the  final  solution  for  use  as  a 
starting  point  for  further  computations. 

SENKIN  output  file,  written  after  the  successful  completion  of  the 
entire  problem. 

Spreadsheet  format  file  which  displays  the  composition  at  the  end  of 
the  sampling  line  as  a  function  of  time. 
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GRADIENT-DATA  TOMOGRAPHY  FOR  HARTMANN  SENSOR  APPLICATION 
TO  AERO-OPTICAL  FIELD  RECONSTRUCTION 


Soyoung  Stephen  Cha 
Associate  Professor 
Department  of  Mechanical  Engineering 
University  of  Illinois  at  Chicago 


Abstract 


Optical  tomography  based  on  the  Hartmann  sensor  can  be  a  very  useful  diagnostic  tool  for  aero-optics 
and  aero-dynamic  measurements.  It  has  strengths  in  remote  sensing  and  gross-field  capture  with  high 
data-acquisition  rate  for  fast  transient  phenomena.  For  expanding  its  practical  applicability,  however, 
some  problems  need  to  be  resolved.  First,  the  Hartmann  sensor  provides  field-gradient  projections. 
Conventional  tomographic  algorithms  require  field  projections  instead  of  gradients,  thus  requiring 
integration  of  gradient  data.  They  also  limit  the  application  scope  to  those  with  zero  boundaries  for 
definite  integration.  Second,  the  fields  to  be  reconstructed  have  been  assumed  to  be  square  or  circular. 
If  the  boundary  is  irregular,  some  reconstruction  errors  arise  under  these  simplifying  assumptions.  The 
reflection  of  accurate  field  shapes  can  result  in  much  better  accuracy.  To  simplify  the  Hartmann  sensor 
optical  setup  or  its  application  scope,  we  also  need  accurate  reconstruction  with  a  less  number  of 
projections,  that  is,  reconstruction  from  limited  data  with  restricted  scanning  and  incomplete  projections. 
The  goal  proposed  here  is  twofold.  The  primary  objective  is  to  develop  tomographic  algorithms,  which 
can  utilize  both  integrated  and  gradient  data  for  arbitrary  field  shapes.  The  method  can  provide  us  with 
flexibility  and  allow  us  to  test  the  accuracy  of  two  modes  of  reconstruction.  The  secondary  goal  is  to 
develop  a  hybrid  technique  to  further  enhance  the  reconstruction  accuracy  by  incorporating  a  priori 
information.  In  these  efforts,  a  new  algorithm,  termed  Curvilinear  Nonlocal  Basis  Function  Method,  has 
been  developed  and  combined  with  the  Complementary  Field  Method  to  produce  a  hybrid  approach.  The 
combined  method  is  tested  especially  for  ill-posed  conditions  but  only  for  integrated  dada,  due  to  the 
limited  funding.  The  performance  of  the  hybrid  method  is  better  than  a  single  method.  The  combined 
method  is  powerful  to  deal  with  arbitrary  boundaries  and  ill-posed  problems  of  limited  data  especially 
for  the  Hartmann  sensor. 
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GRADIENT-DATA  TOMOGRAPHY  FOR  HARTMANN  SENSOR  APPLICATION 

TO 

AERO-OPTICAL  FIELD  RECONSTRUCTION 


Soyoung  Stephen  Cha 


1.  Introduction 

Reconstruction  of  refractive-index  fields  from  projection  data,  that  is,  optical 
tomography,  can  be  a  very  useful  diagnostic  tool  for  flow  field  imaging  especially  in  aero-optics 
or  aero-dynamics.  It  can  provide  merits  in  nonintrusive  remote  sensing,  gross-field  capture,  high 
measurement  accuracy,  and  good  spatial  resolution  as  compared  to  conventional  techmques. 
Currently,  the  aero-optics  research  group  of  the  Lasers  and  Imaging  Directorate  at  the  Air  Force 
Phillips  Laboratory  conducts  tomographic  reconstruction  for  imaging  three-dimensional  (3-D) 
flow  fields.  The  optical  tomographic  system  at  the  Phillips  Laboratory  consists  of  two  parts:  that 
is,  the  Hartmann-sensor  hardware  for  obtaining  projection  data  of  test  fields  and  computational 
software  for  reconstructing  the  fields  from  projections.  Optical  tomography  based  on  the 
Hartmann  system  provides  an  additional  merit,  that  is,  rapid  sampling  of  two-dimensional  (2-D) 
projections  of  a  field.  It  can  thus  allow  continuous  real-time  monitoring  for  reconstructing  3-D 
fast  transient  phenomena.  For  expanding  the  practical  applicability  of  the  Hartmann  sensor, 
however,  some  problems  need  to  be  resolved. 

The  Hartmann  sensor  provides  integrated  gradient  data  of  refractive-index  along  an 
optical  ray  unlike  integrated  optical  pathlength  data  in  interferometric  tomography. 
Conventional  computational  tomographic  algorithms  require  projection  pathlength  data.  The 
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present  practice  thus  resorts  to  additional  data  processing  of  integrating  gradient  data  for 
obtaining  optical  pathlength  data.  Hence,  it  is  very  desirable  to  develop  a  new  approach  that 
allows  an  additional  capability  for  reconstructing  refractive-index  fields  directly  from  gradient 
projections.  The  approach  solely  based  on  the  integrated  data  may  limit  the  application  scope  to 
a  special  type  of  problems:  i.e.,  free  jets  with  zero  projection  values  at  the  boundary.  Second,  the 
fields  to  reconstructed  have  been  assumed  to  be  square  or  circular.  If  the  field  boundary  is 
irregular,  some  reconstruction  errors  arise  under  these  simplifying  assumptions  of  the  field  shape. 
The  reflection  of  accurate  field  shapes  can  results  in  much  better  accuracy  especially  under 
limited  data  points  with  restricted  angular  scanning  and  incomplete  projections  whose  capability 
is  much  needed  as  explained  later.  The  proposed  research  goal  is  twofold. 

•  The  primary  objective  of  the  proposed  research  is  to  develop  and  test  computational 
tomographic  algorithms  for  reconstructing  refractive-index  fields,  which  can  utilize 
both  integrated  and  gradient  data  obtained  by  Hartmann  sensors  for  arbitrary 
boundary  shapes.  The  method  can  provide  us  with  flexibility  and  allow  us  to  test  the 
accuracy  of  two  modes  of  reconstruction. 

Tomographic  hardware  collects  projection  information.  Hartmann  sensors,  being  based 
on  micro-fabrication  binary-optics,  can  provide  only  a  small  sensing  area.  Currently, 
experiments  at  the  Phillips  Laboratory  employ  a  parallel-beam  configuration  to  obtain  projection 
data.  If  so,  beam-collimation  and  collection  lenses,  which  are  greater  in  size  than  the  field  to  be 
scanned,  are  needed.  For  large-field  scaxming,  this  approach  becomes  impractical,  resulting  in  a 
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bulky  system  that  consists  of  many  beam  projection-collection  pairs  of  comparable  aperture,  that 
is,  many  large  and  expensive  lenses.  It  is  thus  necessary  to  produce  efficient  hardware  and 
software  to  reconstruct  fields  with  limited  data  points  restricted  in  scanning  and  incomplete  in 
projection.  This  allow  us  to  provide  a  much  simpler  setup  at  a  reduced  cost  in  conducting 
experiments. 

•  The  secondary  goal  of  the  proposed  research  is  to  develop  a  hybrid  technique  which 
we  believe  to  further  enhance  the  reconstruction  accuracy  by  incorporating  a  priori 
information. 

In  these  efforts,  a  new  reconstruction  algorithm,  termed  Curvilinear  Nonloceil  Basis 
Function  Method  (CNBFM),  has  been  developed  and  combined  with  the  Complementary 
Field  Method  (CFM)  to  produce  a  hybrid  approach.  The  combined  method  is  tested  with 
numerically  generated  fields  especially  under  ill-posed  conditions  but  only  for  integrated 
dada,  due  to  the  time  limitation  in  the  funding.  The  complete  accomplishment  of  the 
research  to  fully  understand  the  nature  of  the  proposed  goals  is  very  extensive  and  requires  a 
long-term  commitment.  The  summary  of  the  investigation,  including  approaches, 
methodology,  and  analyses  are  presented  hereafter 

2.  Curvilinear  Nonlocal  Basis  Function  Method  (CNBFM) 

Mathematically,  any  2-D  field  f(x,y)  can  be  expressed  by  an  infinite  series.  Series 
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expansion  methods  are  suitable  for  ill-posed  reconstruction.  The  bases  in  series  expansion  can 
be  local  or  nonlocal.  Some  example  of  local  basis  function  methods  are  the  fixed  grid  method, 
cubic-spline  method,  variable  grid  method  [1,2],  etc.  Nonlocal  basis  function  methods  include 
the  sine  function  method  and  orthogonal  series  method  [3,4].  Practically  all  the  previous  efforts 
were  made  on  the  fields  with  regular  field  boundaries  and  without  opaque  objects  [5,6,7],  i.e., 
circular  or  rectangular.  In  the  CNBFM,  fields  are  represented  by 

f(r,*)=f;CiS,(r,i|,).  (1) 

i=0 

Here  Sj(r,(t))  is  the  series  term  in  polar  coordinates  and  Cj  is  the  correspondent  coefficient.  With  a 
reasonable  accuracy,  the  infinite  series  can  be  approximated  by  a  finite  terms  of 

M 

f(r,<t))=XCiSi(r.(t>)  (2) 

i=i 

where  M  is  the  number  of  basis  functions.  Various  basis  function  can  be  chosen  for  series 
expansion.  Here  Legendre  polynomials  in  r  and  the  harmonic  functions  in  (j),  i.e.,  sine  and  cosine 
functions,  were  adopted  as  basis  functions.  If  so,  equation  (2)  becomes 

M  N 

f(L<t))~  ^  ^Cntin  Pm(r)On(<|)) 

m=0n=0 

M  N 

mn  Pm  (r)cos(n(|))  +  Bmn  Pm(r)sin(n(j))]  (3) 

m=0n=0 

where  On((j))  is  a  circular  harmonic  of  order  n,  Pn,(r)  is  the  Legendre  polynomial  of  order  of  m,  M 
and  N  are  the  highest  order  of  polynomials  and  harmonic  functions,  respectively.  are  the 
coefficients  to  be  determined  for  a  field  reconstruction.  If  these  coefficients  are  known,  the 
distribution  of  the  physical  property  of  the  field  can  be  found.  The  primary  goal  of  reconstruction 
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is  thus  to  find  the  coefficients  in  Equation  (3). 

The  optical  pathlength  data  can  be  expressed  by 

g(p,9)  =  J  f(r,  (j))  5(p  -  X  cos  (j)  -  y  sin  (t>)  ds 

ray 

M  N 

~  J  [A-mn  Pm  (r)cos(n(|»)  +  Bmn  Pm(r)sin(n<t>)]ds 

ray  rn  n 

M  N 

=  ZIIA  mn  j  Pni(r)cos(n(t))ds  +  Bmn  I  Pm(r)sin(n(j))ds]  (4) 

m  n  ray  ray 

The  total  number  of  unknowns  in  Equation  (4)  is  M(2N-1).  Depending  on  the  view  angle  and 
number  of  data  points  in  each  projection,  that  is,  step  sizes  of  p  and  0  in  sampling,  which  are 
denoted  by  Ap  and  A0,  the  total  number  of  equations  can  vary  significantly.  Practically,  the 
number  of  equations  can  always  be  greater  than  that  of  imknowns  to  make  the  system  of  linear 
algebraic  equations  over-determined.  The  redxmdancy  is  very  important  in  calculating  the 
unknowns  when  higher  accuracy  is  required.  A  convenient  and  also  effective  method  for  solving 
this  redundant  system  is  the  least  square  method. 

Assume  that  Equation  (4)  is  expressed  by  a  matrix  notation  of  AX=B  where  A,  X,  and  B 
represent  Radon  transform  values  of  individual  series  terms  for  rays  (p,0),  series  expansion 
coefficient,  and  measured  data,  respectively.  In  a  system  of  linear  equations,  if  there  exist  errors 
in  some  elements  in  the  coefficients  matrix  A  or  the  constant  vector  B  in  AX=B,  a  large  amount 
of  distortions  may  arise  in  the  estimated  result  of  X.  There  always  exist  some  errors  in 
measurements  of  optical  pathlength  data.  To  reduce  the  effect  of  measurement  errors  and  ill- 
posed  data  collection  as  much  as  possible,  suitable  redundancy  is  necessary.  There  should  be  a 
trade-off  in  selecting  the  optimal  numbers  of  (Ap,A0)  and  (M,N).  These  values  differ  for 
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individual  problems. 

When  tomographic  reconstruction  is  performed  upon  a  field  \vith  an  opaque  object  inside 

r  —  R 

the  field,  the  polynomial  P„(r)  in  Equation  (3.5)  changes  to  Pm{2 - - 1),  where  R„,i„  and 

Rfnax  ~  Rftiin 

are  shown  in  Figure  1.  This  will  guarantee  the  argument  of  P„  ranges  in  [-1,1]  where  the 
orthogonal  Legendre  polynomials  are  defined. 

The  above  algorithm  implemented  into  a  computer  code  can  be  described  step  by  step  as 
follows: 

1 .  Assume  a  field  f(r,(t)),  or  in  the  Cartesian  coordinates  f(x,y),  boundaries  of  the  field,  and 
an  opaque  object  together  with  the  sets  of  (Ap,A6)  and  (M,N). 

2.  Integrate  f(x,y)  along  each  probing  ray  (p,0)  to  obtain  g(p,0). 

3.  Integrate  the  M(2N-I)  basis  functions  along  the  same  ray  (p,0)  according  to  right-hand 
side  of  Equation  (4). 

4.  Set  up  a  system  of  linear  algebraic  equations  for  each  ray  (p,0)  based  on  Equation  (4). 

5.  Apply  the  least  square  method  to  solve  the  over-determined  system  of  equations  obtained 
from  step  4  to  find  the  coefficients  and  B„„. 

6.  Reconstruct  the  field  by  substituting  A,„„  and  B„„  back  into  Equation  (3).  Compare  the 
original  values  of  f(x,y)  with  the  reconstructed  values  with  a  set  of  sampling  points  by 
finding  the  average  and  maximum  errors. 

2.1.  Computer  Simulation 

In  order  to  test  the  reconstruction  accuracy  by  the  algorithm  presented  in  Section  2,  we 
used  computer-  simulated  data  for  refractive-index  fields.  Two  different  fields  were  tested,  field 
f,(x,y)  with  a  single-hump  and  field  f2(x,y)  with  double-humps.  The  mathematical  expressions 
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(5) 


for  f,(x,y)  and  f2(x,y)  are  as  follows: 
f,(x,y)=0.7exp{-5[(x-0.1)=  +  (y-0.2)’  ]J 


f,{x,y)=0.8exp  2^'*-  +0-6exp  -25 

X  y 


(x  +  0.25f +(y  +  0.35r 
2.1-x^-y^ 


An  opaque  object  was  selected  in  testing  the  reconstruction  algorithm  for  incomplete  projection 


data,  that  is , 


5(x+0.1)M(y-0.1)'=l 


For  the  purpose  of  comparison,  we  defined  two  types  of  errors  as  follows; 

Max(|f(xi.yi)-fr(xi,yi)|)  (8) 

Max(lf(xi,yi)l) 

Nd 

2lf(xi,yi)-fr(xi,yi)l 

CTave  =  -  aZ - ^ ^ 

Nd  •Max(jf(xi,yi)|) 

where  a^ax  <^ave  denote  maximum  error  and  average  error,  respectively.  is  the  total 
number  of  nodes  (Xi,yi)  inside  the  field  boundary  but  outside  of  the  opaque  object  when  an 


opaque  object  is  present. 

To  investigate  the  behavior  of  the  algorithms  under  incomplete  projection,  various  view 
angles  were  tested  for  each  of  the  six  different  test  cases,  that  is,  full  angle  180°,  160°,  120 , 90  , 
60°,  and  40° ,  respectively.  For  each  test  case,  there  were  many  combination  of  M  and  N  as  the 
highest  order  of  polynomials  and  harmonic  functions.  Different  combinations  of  (M,N)  result  in 
different  reconstruction  accuracies.  There  must  be  an  optimal  (M,N)  corresponding  to  the  best 
accuracy  in  each  different  case.  Tables  1  and  Table  2  show  the  minimum  value  of  the  maximum 
and  average  errors  and  the  corresponding  (M,N).  Figures  2  and  3  graphically  show  the  results. 
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Obviously  for  all  the  cases,  the  maximum  and  average  errors  decreased  considerably  as  the  view 
angle  increased.  Some  plots  of  the  reconstructed  fields  are  presented  in  Figures  4  and  5.  Figure 
4  shows  the  reconstruction  result  for  the  one-hump  field  with  an  opaque  object  under  the  full 
view  angle.  There  was  no  visible  distortion  as  seen  in  the  plot.  Quantitatively,  the  maximum 
and  average  errors  were  as  low  as  0.086%  and  0.001%,  respectively.  When  the  view  angle 
becomes  the  lowest  value  of  40°  as  seen  in  Figure  5,  there  is  still  no  visible  distortion  of  the  field 
with  the  maximum  and  average  errors  of  0.39%  and  0.036%,  respectively.  For  the  cases  with 
two  humps  as  shown  in  Figure  6,  visible  distortions  appear  with  the  small  view  angle  of  40°  in 
the  reconstructed  fields  especially  in  the  area  blocked  by  the  opaque  object.  This  is  the  direct 
effects  of  ill-posed  data  sampling. 

2.2.  Conclusion 

A  new  tomographic  algorithm  has  been  developed  by  employing  a  series  expansion 
method  based  on  nonlocal  basis  functions,  termed  Curvilinear  Nonlocal  Basis  Function  Method 
(CNBFM).  Unlike  local  base  function  methods,  it  represents  the  entire  field  with  an  orthogonal 
series.  By  finding  the  coefficients  of  all  the  basis  functions  with  the  measured  data,  the  field  can 
be  reconstructed.  Computer  simulation  results  show  that  the  CNBFM  provided  high 
reconstruction  accuracy  after  optimization  of  the  highest  order  of  polynomials  and  harmonic 
functions.  The  results  indicate  the  developed  algorithms  are  reliable  and  applicable  in 
reconstructing  3-D  flow  fields. 

3.  Complementary  Field  Method  (CFM) 

The  Complementary  Field  Method  (CFM)  is  iterative  and  was  developed  by  Sun  and  Cha 
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[7,  8]  as  a  unified  approach  for  reconstructing  fields  under  various  ill-posed  conditions.  It 
utilizes  the  field  estimation  as  a  priori  information.  The  step-by-step  procedure  is  as  follows: 

1 .  Assume  an  initial  estimate  fe(x,y)  of  the  original  field  f(x,y)  to  be  reconstructed. 

2.  Calculate  the  projection  data  ge(p,0)  of  the  field  estimate  fe(x,y)  only  at  the  points  where 
the  measured  g(p,0)  are  available: 

ge(p,0)=L{  fe(x,y)} 

where  L  is  the  line  integral  transform  operator. 

3.  Calculate  the  difference  in  the  projection  domain  for  the  projection  data  of  the 
complementary  field  fc(x,y)  which  is  the  difference  between  f(x,y)  and  fe(x,y): 

g,(p,0)=C,{  g(p,0)-ge(p,0)} 

where  C,  is  the  constraint  operator  about  the  Radon  transform  of  the  complementary  field. 

4.  Reconstruct  the  complementary  field: 

f,(x,y)=R{  g,(p,0)} 

where  R  is  the  reconstruction  operator. 

5.  Improve  the  estimate  of  the  original  field  by 

fr(x,y)=  fe(x,y)+  fc(x,y) 

Now  physical  constraints  can  be  incorporated  to  make  the  estimate  more  meaningful. 
f;"*'^(x,y)=C2{  f,(x,y)} 

where  Cj  is  the  constraint  operator  on  the  reconstructed  field. 

6.  Return  to  step  2  with  the  new  estimate  t"^*(x,y)  unless  some  suitable  termination  criteria 
are  satisfied. 

Based  on  a  priori  information,  the  two  important  types  of  constraints  C,  and  Cj  can  be 
formulated.  Possible  a  priori  information  can  be:  (1)  the  Radon  transform  of  the  original  field 
such  as  positiveness  of  the  line  integrals,  periodicity  of  measured  data  with  respect  to  the 
projection  angle;  (2)  physical  configuration  information  of  the  field  such  as  the  range  of  field 
values  and  local  maximum  value  locations.  Operators  R  and  L  in  the  step  2  and  4  may  bring 
some  inherent  errors  because  they  operate  with  approximate  discrete  numerical  processing  and  a 
finite  number  of  ill-posed  data.  All  reconstruction  algorithms  practically  produce  a  null  field 
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from  null  projection  data.  This  implies  that  the  closer  the  fe(x,y)  is  to  f(x,y),  the  nearer  fc(x,y)  is 
to  a  null  field  to  achieve  a  more  accurate  reconstruction.  This  can  be  easily  seen  in  the  above 
iterative  steps.  One  advantage  of  the  CFM  is  the  ease  in  using  a  priori  information.  Even  slight 
knowledge  of  the  field  can  be  incorporated  very  easily  into  the  procedure.  Compared  with 
conventional  iteration  methods  that  directly  work  with  limited  data,  the  CFM  works  with  the 
complementary  field. 

3.1.  Computer  Simulation 

In  the  computer  simulation  of  experiments,  the  results  from  the  reconstruction  were 
compared  with  the  originally-known  test  fields,  that  is,  single-hump  field  f,(x,y)  and  double¬ 
hump  field  f2(x,y)  as  in  Section  2.  In  numerical  simulations,  the  boundaries  of  the  fields  were 
defined  as 

2xV=l  (10) 

In  defining  the  opaque  objects,  consideration  was  given  to  generate  asymmetric  configurations. 
Two  different  opaque  objects  were  selected  to  test  the  reconstruction  algorithm  That  is, 
opaque  object#!:  4(x+0.1)^+3(y-0. 1)^=1;  (11) 

opaque  object  #2:  5(x+0.1)^+4(y-0. 1)^=1.  (12) 

In  this  manner,  six  different  test  cases  were  generated  for  testing;  more  specifically,  one-hump 
field  and  two-hump  field  with  and  without  opaque  objects  #1  and  #2,  respectively.  To 
investigate  the  behavior  of  the  algorithms  under  incomplete  projection,  various  view  angles  were 
tested  for  each  of  the  six  different  test  cases:  that  is,  full  angle  180°,  160°,  120°,  90°,  60°,  and  40°, 
respectively. 

Figures  7  to  10  show  the  results  from  computer  simulation  of  experiments.  The  general 
trends  are  similar  to  the  cases  of  the  CNBFM.  For  all  the  cases,  the  maximum  and  average  errors 
decreased  considerably  with  the  increase  of  view  angle.  For  the  cases  with  two  humps  ,  visible 
distortions  appears  with  the  small  view  angles  of  40°  and  60°  in  the  reconstructed  fields 
especially  in  the  area  blocked  by  the  opaque  object.  This  is  the  direct  effects  of  ill-posed  data 
sampling.  But  the  reconstruction  of  the  areas,  which  were  scanned  by  the  probing  beams,  were 
fairly  reliable.  Since  the  CFM  was  developed  to  deal  with  ill-posed  problems  and  also  the 
CNBFM  showed  good  reconstruction  accuracy  under  well-posed  conditions,  the  CFM  was 
mainly  tested  for  the  reconstruction  of  ill-posed  problems,  i.e.,  with  an  opaque  object  in  a  field 
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and  a  view  angle  smaller  than  90°.  Tables  3,  4  and  5  show  the  maximum  and  average  errors  for 
the  cases  of:  (1)  no  opaque  object  in  the  two-hump  field;  (2)  opaque  object  #1  in  the  two-hump 
field;  and  (3)  opaque  object  #2  in  the  two-hump  field,  respectively. 

It  can  be  clearly  seen  that  under  a  limited  view  angle  and  with  an  opaque  object  inside  the 
field,  the  CFM  increased  the  accuracy  in  reconstruction.  The  relatively  small  reduction 
improvement  when  the  CFM  was  incorporated  is  believed  to  be  due  to  the  good  performance  of 
the  CNBFM.  The  good  reconstruction  by  the  CNBFM  left  a  minimal  improvement  by  the  CFM. 
Since  the  field  boundaries  and  opaque  object  shapes  were  specified  in  the  CNBFM,  it  practically 
utilized  a  priori  information.  This  can  be  one  of  the  reasons  why  the  CNBFM  provided  good 
reconstruction.  The  a  priori  information,  however,  could  have  been  incorporated  in  the  CFM. 

3.2.  Conclusion 

In  the  research,  a  few  different  computer-generated  fields  were  used  to  test  the 
performances  of  the  CNBFM  combined  with  the  CFM.  The  combination  of  the  CNBFM  and  the 
CFM  provided  satisfactory  accuracy  especially  under  severely  ill-posed  conditions,  i.e.,  limited 
view  angle  and/or  with  opaque  object  inside  a  field  which  blocked  part  of  the  probing  rays.  The 
performance  of  the  hybrid  method  is  apparently  better  than  using  a  single  method.  The  result 
shows  that  the  combined  method  can  be  powerful  to  deal  with  ill-posed  conditions  with  limited 
data  especially  for  the  Hartmann  sensor. 
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Figure  1 .  Definition  of  the  Rmm  and  Rm„. 


Figure  2.  Reconstruction  errors:  one-hump  field  with  opaque  object 
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Figure  3.  Recnnstruction  errors:  2-hump-fieId,  opaque  object. 


Figure  4.  Reconstruction  re.sult:  one-hump  field  with  no  opaque  object  and  the 

full  view  angle  of  180" . 
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Figure  7.  Reconstruction  errors:  one-hump 
field  with  no  opaque  object. 


Figure  8.  Reconstruction  errors;  one-hump 
field  with  opaque  object  #2. 


Figure  9.  Reconstruction  errors;  two-hump. 
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Figure  10.  Reconstruction  errors:  one-hump. 


TABLE  1.  Reconstruction  errors;  one  hump  field  with  opaque  object  #2. 


TABLE  2.  Reconstruction  errors:  two-hump  field  with  opaque  object  #2. 


TABLE  3.  Reconstruction  error  comparison  between  the  CNBFM  and  CNBFM+CFM: 

two-hump  field  with  opaque  object  #1 .  _ 
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TABLE  4.  Reconstruction  error  comparison  between  the  CNBFM  and  CNBFM+CFM: 


two-hump  field  with  no  opaque  object. 

Max.  ernm  (%) 

Av.  error  (%) 

CNBFM 

CNBFM  +  CFM 

Error  decreases 

90" 

7.9X3 

1.433 

1.457 

-0.118 

60" 

16.550 

12.459 

4.091 

1.961 

1.704 

0.257 

40" 

27.635 

22.162 

5.473 

3.427 

2.975 

0.452 

TABLE  5.  Reconstruction  error  comparison  between  the  CNBFM  and  CNBFM+CFM; 
_ two-hmTipJleld  with  opaque  object  #2.  _ 


Max.  error  (%) 

Av.  error  (%) 

CNBFM 

CNBFM  +  CFM 

Error  decreases 

90" 

6.107 

5.050 

1.057 

0.414 

-0.014 

60" 

■■EmHi 

40" 

39.304 

29.796 

IH9I 

4.080 

3.263 
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Abstract 

A  neural  network  with  time-delay  elements  has  been  shown  to  be  a  promising 
approach  to  the  prediction  of  optical  distortion  due  to  air  flow  perturbations.  Pre¬ 
liminary  experiments  accomplished  during  the  AFOSR  summer  research  program 
have  indicated  that  the  neural  network  shows  potential  for  extracting  patterns,  and 
could  lead  to  new  methods  that  could  be  effective  for  these  predictions.  If  carefully 
timed  with  appropriate  feedback  sensors,  neural  network  output  could  potentially  be 
used  to  control  an  array  of  tiny  mirrors  to  provide  ongoing  corrections  to  distortions 
caused  by  turbulent  air  flows.  In  experiments  performed  at  the  Phillips  Laboratory, 
a  jet  flow  of  hot  air  was  produced  and  measured  at  multiple  downstream  positions 
to  determine  the  amount  of  optical  refraction  produced  at  the  interface  between 
the  hot  air  flow  and  the  surrounding  cooler  air  ([MVF95]  [MMC+95].  A  series  of 
measurements  were  taken  over  time,  to  measure  the  time-dynamic  structure  of  the 
air  jet  flow  from  varying  distances  from  the  flow  nozzle.  Preliminary  experiments 
were  based  on  this  data.  In  this  report,  we  extend  our  work  to  include  further 
computations  on  air  flow  prediction,  increased  capabilities  of  the  neural  network 
implementations  for  prediction,  and  consideration  of  a  comparison  between  the  sta¬ 
tistical  method  of  linear  stochastic  estimation  and  neural  networks.  We  also  report 
some  intriguing  properties  of  dynamic  neural  networks,  which  show  promise  for 
future  research  but  are  not  immediately  ready  for  application.  We  describe  work 
currently  in  progress. 
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1  Overview  and  Significance 


Dynamic  neural  networks  include  architectures  that  imbue  the  network  with  properties 
that  are  dynamic  over  time,  such  as  the  ability  to  model  and  predict  time  series  that  are 
complex  or  chaotic,  the  ability  to  evoke  oscillations  in  the  network,  and  the  capability  of 
training  the  network  on  oscillations,  including  complicated  oscillations  that  do  not  appear 
to  repeat  exactly  over  any  observed  length  of  time.  Networks  that  are  promising  for 
time  series  predictions  include  feed-forward  architectures  with  time-delay  elements,  such 
as  the  time-delay  neural  network  (TDNN)  and  the  adaptive  time-delay  neural  network 
(ATNN).  These  networks  adapt  weights  and  sometimes  time-delays,  and  have  proven 
highly  effective  in  being  trained  to  produce  limit  cycle  oscillations  and  chaotic  time  series 
((LDL94],  [LDL95]). 

We  have  researched  the  possibility  that  neural  networks  can  be  used  to  predict  optical 
wave  front  distortions  in  a  flow  fleld,  where  dynamically  changing  air  flows  can  distort 
optical  signals.  The  refraction  of  an  optical  signal  changes  dynamically  over  time  because 
of  the  continuous  motion  and  turbulence  in  a  warm  air  flow.  Whereas  individual  points  of 
laser  light  are  refracted  by  the  air  flow,  which  has  been  experimentally  measured,  entire 
images  would  also  be  distorted.  Our  long-term  goal  is  to  be  able  to  restore  an  optical 
image  at  a  high  resolution,  by  compensating  for  the  distortion  in  a  timely  fashion,  utilizing 
the  neural  network’s  prediction.  In  this  study,  we  are  predicting  refraction  of  an  individual 
laser  light  signal  with  a  neural  network,  so  that  the  neural  network  prediction  can  later 
be  used  to  compensate  for  the  refraction  and  to  stabilize  and  clarify  the  reception  of  the 
light  signal. 
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An  experiment  was  previously  devised  and  implemented  at  the  Phillips  Laboratory  to 
measure  the  optical  refraction  of  the  boundary  between  a  hot  air  flow  and  the  surrounding 
cooler  air  ([MMC+95]).  Sensors  were  configured  to  detect  the  change  in  position  of  a  laser 
light  after  it  passes  through  a  warm  jet  air  flow.  The  sensor  consisted  of  a  lateral  eflFects 
detector  (LED),  which  is  a  dual-axis  photodiode  that  can  measure  the  centroid  location  of 
an  incident  laser  beam.  The  LED  works  similar  to  typical  photodiodes  in  that  it  induces  a 
current  proportional  to  the  intensity  of  the  beam.  The  current  is  divided  among  four  strip 
terminals,  where  current  is  measured,  and  the  position  of  the  beam  is  then  calculated. 

With  the  LED  sensors,  data  measurements  were  taken  at  varied  distances  and  from 
different  orientations.  Each  data  channel  reports  refraction  over  a  series  of  time  steps. 
Measurements  included  data  taken  at  different  distances  from  the  origin  of  the  jet  flow, 
in  X  and  y  directions,  and  under  varying  conditions  of  forced  flow,  non-forced  flow,  and 
background. 

Preliminary  experiments  were  performed  in  which  a  neural  network  was  applied  to 
the  air  flow  data.  A  time  history  of  measurements  was  used  to  train  the  neural  network 
to  predict  the  next  measurement  in  the  sequence.  A  series  of  preliminary  computational 
training  runs  were  performed  in  which  networks  were  trained  on  the  first  500  data  points 
and  tested  on  the  next  1000  data  points.  A  high  performance  was  attained,  with  the 
neural  network  predictions  usually  explaining  80-93  %  of  the  variation.  These  results  are 
highly  promising  and  motivate  further  studies  on  neural  network  prediction  of  air  flow 
data. 

The  neural  networks  used  for  air  flow  prediction  were  feed-forward  configurations 
with  three  layers.  Time-delay  elements  were  included  on  both  layers  of  weights,  causing 
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inputs  to  the  network  to  include  a  time  history  of  data.  In  addition,  time-delays  on  the 
hidden-to-output  layer  allowed  the  output  layer  to  receive  a  time  history  of  hidden  unit 
activations.  Adaptive  training  was  performed  on  the  weights,  and,  in  some  cases,  on  the 
time-delays  as  well.  A  new  training  run  was  done  on  each  data  signal.  For  each  data 
signal,  the  network  configuration  was  left  the  same  but  a  different  set  of  initial  starting 
weights  was  used.  Training  converged  successfully  in  almost  all  cases  where  the  data 
was  collected  with  forced  or  non-forced  air.  For  background  measurements,  the  neural 
network  often  did  not  converge,  as  the  measurements  were  extremely  small.  In  a  few 
cases,  background  recordings  were  predicted  by  the  neural  network,  presumably  because 
of  ambient  perturbations  present  in  the  air  flow. 

This  SREP  project  extends  the  preliminary  experiments  to  research  more  situations 
appropriate  to  controlling  and  correcting  for  realistic  optical  distortions.  The  preliminary 
experiments  were  limited  in  the  length  of  data  analyzed,  the  amount  of  pre-processing,  and 
the  timing  of  data  measurements  used  for  analysis.  We  are  addressing  these  limitations. 
We  also  consider  here  a  comparison  between  traditional  statistical  methods  for  time  series 
prediction  and  the  neural  network.  Lastly,  we  explore  the  computational  abilities  of 
a  neural  network  that  is  similar  to  the  time-delay  neural  network  used  in  our  air  flow 
studies,  but  extends  its  dynamic  properties. 
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Figure  1:  Air  jet  flow  apparatus.  Reprinted  with  permission. 

2  Methodology 

2.1  Wave  Front  Sensing 

Figure  1  shows  the  heated  air-jet  flow  generation  system.  A  blower  pumps  air  across  a 
heater  and  into  a  plenum  chamber,  which  becomes  a  source  for  heated  air  to  flow  upwards 
into  a  nozzle  to  form  a  jet  of  heated  air  from  a  2.5-cm  opening.  The  air-jet  flow  then  rises 
vertically,  and  a  laser  beam  is  passed  through  the  heated  air  as  it  moves  upwards.  The 
sensor  receives  the  laser  light  and  measures  the  focal  spot  intensity  on  a  plane,  placed 
perpendicular  to  the  laser  light  beam.  The  velocity  of  the  flow  center  is  4.5  m/s  on 
average,  and  the  average  temperature  of  the  air  in  the  jet  is  approximately  10  degrees  C 
above  ambient. 

The  distance  of  the  sensor  above  the  nozzle  can  be  varied,  and  a  spectrum  of  data  was 
taken  at  different  distances.  Measurements  were  taken  above  the  nozzle  and  perpendicular 
to  the  air  flow.  These  measurements  were  taken  in  the  x  and  y  directions  in  the  plane 
perpendicular  to  the  air  flow.  The  experimental  conditions  included  (1)  forced  air  flow, 
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(2)  non-forced  air  flow,  and  (3)  background.  Thus,  for  each  height  above  the  nozzle,  6 
data  streams  were  taken,  consisting  of  two  different  directions  under  each  of  the  three  air 
flow  conditions. 

The  sensor  was  a  lateral  effects  detector  (LED),  which  is  comprised  of  a  dual-axis 
photodiode.  The  LED  induces  a  current  from  which  the  centroid  location  of  an  incident 
laser  beam  can  be  calculated.  The  current  is  proportional  to  the  intensity  of  the  beam, 
and  is  divided  among  four  strip  terminals,  where  current  is  measured,  so  that  the  position 
of  the  beam  can  be  calculated  [Lun96]. 

2.2  Neural  Networks  with  Embedded  Time-Delays 

The  time-delay  neural  network  (TDNN)  originally  proposed  by  Waibel  ([Wai89],  [WHH'^89], 
[HW92])  employs  time-delays  on  connections  and  has  been  successfully  applied  to  phoneme 
recognition,  classification  of  spatiotemporal  patterns  ([LDL92c]),  and  missile  discrimina¬ 
tion  ([LDR95].  In  the  TDNN  architecture,  each  neuron  takes  into  account  not  only  the 
current  information  from  its  input  neurons  of  the  previous  layer,  but  also  a  certain  amount 
of  past  information  from  those  neurons  due  to  delays  on  interconnections.  Typically  the 
time  delays  are  evenly  spaced  over  a  time  interval  called  the  frame  window,  although 
arbitrary  time  delays  may  be  used.  Training  is  done  with  spatiotemporal  patterns,  one 
or  more  signal  channels  over  time,  and  the  classification  of  those  patterns  is  reported  at 
each  time  step  by  the  output  layer.  After  training,  the  weights  are  strengthened  along 
those  interconnections  whose  time  delays  are  important  to  recognition. 

The  adaptive  time-delay  neural  network  (ATNN),  which  adapts  time  delays  as  well 
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Figure  2:  Delay  block  in  a  time-delay  neural  network. 

as  weights  during  training,  is  a  more  advanced  version  of  the  TDNN.  The  result  is  a  dy¬ 
namic  learning  technique  for  spatiotemporal  classification  and  for  time  series  prediction 
([LDL92a]).  The  ATNN  model  employs  multiple  interconnections  between  each  pair  of 
neurons  in  a  feed-forward  configuration,  with  each  interconnection  having  its  own  weight 
and  time-delay.  Both  time  delays  and  weights  are  adjusted  according  to  system  dynamics 
in  an  attempt  to  achieve  the  desired  optimization.  The  adaptation  of  the  delays  and 
weights  are  derived  based  on  the  gradient  descent  method  to  minimize  the  error  during 
training.  Processing  units  do  not  receive  data  through  a  fixed  time  window,  but  gather 
important  information  from  various  time  delays  which  are  adapted  via  the  learning  pro¬ 
cedure. 

The  schematic  architecture  of  the  connections  from  one  processing  unit  to  another 
processing  unit  of  the  ATNN  is  depicted  in  Figure  2.  The  configuration  of  multiple 
interconnections  between  a  single  pair  of  units,  each  with  its  own  delay,  is  called  a  delay 
block.  Node  i  of  layer  h  —  1  is  connected  to  node  j  of  the  next  layer  h,  with  the  connection 
line  k  having  an  independent  time  delay  rji*.,/,.!  and  synaptic  weight  Wjik^h-v 
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input  i 
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INPUT  LAYER  HIDDEN  LAYER  OUTPUT  LAYER 

Figure  3:  Time-delay  neural  network  depicted  as  built  from  delay  blocks. 

The  entire  network  is  constructed  by  the  delay  blocks  that  connect  neurons  layer  by 
layer,  as  illustrated  in  Figure  3.  It  is  not  necessary  to  have  the  same  number  of  delays 
for  different  units  in  the  same  layer  or  the  same  delay  values  from  different  units,  because 
the  computation  is  local  for  each  interconnection.  Each  connection  can  have  an  arbitrary 
delay  value  and  each  pair  of  neurons  can  have  any  number  of  delayed  interconnections. 

In  the  TDNN,  the  adaptation  variables  are  the  weights,  and  in  the  ATNN,  the  adap¬ 
tation  variables  are  time-delays  and  weights.  Each  node  sums  up  the  net  inputs  from  the 
activation  values  of  the  previous  neurons,  through  the  corresponding  time  delays  on  each 
connection  line,  that  is,  at  time  (e.g.,  time  step  n)  unit  j  on  layer  h  receives  a  weighted 
sum 

^j,n  ~  1  (^n  '^jik,h—l)  (1) 

where  is  the  activation  of  unit  i  in  layer  h  —  l  at  time  t.  Then  the  output  of  node 

j  is  governed  by  a  nondecreasing  sigmoid  function  as  follows 

f{S)  =  2/(1  -  e-®)  -  1  (2) 
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where  S  is  the  incoming  sum,  as  in  (1). 

The  adaptation  of  the  delays  and  weights  are  derived  based  on 

method  to  minimize  the  error  function  E  during  training,  where 

the  gradient  descent 

E(t„)  =  l/2[£,(d,(t„)  -  a,,^(t„))’] 

(3) 

where  dj{tn)  is  the  target  value  for  output  unit  j  at  time  tn- 

The  weights  and  time  delays  are  updated  step  by  step  proportional  to  the  opposite 

direction  of  the  error  gradient,  repectively. 

(4) 

(5) 

where  t]i  and  rj2  are  the  learning  rates. 

The  derivation  of  this  algorithm  has  been  previously  addressed  ([DD91],  [DD93], 
[LDL92b],  [LDL92a]).  The  learning  rules  may  be  summarized  as  follows 

^Wjik,h-l  =  -  Tjik,h-\)  (6) 

^'^jik,h—l  —  l  (tn  '^jik,h—l^  (7) 

where 

^j,h{in)  =  {dj{tn)  -  o-j,h{^n))f'{Sj,h{tn))  T  J  is  an  output  unit,  and 
dj,h{tn)  =  T,pT,q6p^h+i{i7i)wpjq,h{tn)f'{Sj^h{in))  T  j  is  a  hidden  unit. 

Characterization  and  application  of  the  ATNN  network  has  been  addressed  previously 
([LDL95]). 
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2.3  Dynamic  Neural  Networks 


Neural  networks  with  time-delay  elements  can  learn  dynamic  behavior  by  training  to  pro¬ 
duce  complex  oscillations  or  limit  cycles.  Previous  research  has  demonstrated  training  of 
TDNN  and  ATNN  networks  on  a  chaotic  sequence  ([LDL94])  and  on  limit  cycle  attractors 
([LDL95]).  The  type  of  air  flow  data  analyzed  in  this  report  has  previously  been  shown  to 
contain  similar  dynamic  fluctuations  ([MVF95]).  Thus  the  TDNN  and  ATNN  approach 
is  highly  promising.  We  have  also  examined  an  e^rtension  to  this  approach  through  re¬ 
searching  a  single-layer  recurrent  network  with  random  weights  which  can  be  altered  to 
evoke  chaotic  behavior  spontaneously  without  training.  Specific  patterns  applied  to  this 
network  evoke  specific  oscillations.  Thus,  a  pattern-to-oscillation  map  could  be  used  for 
recognition. 

2.4  Preliminary  Results 

TDNN  neural  networks  were  trained  on  a  series  of  recorded  air  jet  flows.  The  config¬ 
uration  used  was  1  input  unit,  3  hidden  units,  and  one  output  unit.  There  were  two 
interconnections  connecting  each  pair  of  neurons  in  a  feed-forward  direction.  For  each 
pair,  one  interconnection  had  no  time-delay  (e.g.,  a  time-delay  of  zero),  and  the  other 
interconnection  had  a  time-delay  of  one  time  step.  In  our  initial  set  of  training  runs, 
reported  here,  the  weights  were  trained  and  the  time-delays  were  held  fixed.  The  learning 
rate  parameter,  t),  was  started  at  0.2,  its  highest  value,  and  then  relaxed  linearly  to  take 
on  lower  and  lower  values  over  20,000  iterations  of  training.  The  final  learning  rate  value 
was  0.08.  Training  was  terminated  automatically  after  20,000  iterations. 
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The  data  sets  were  prepared  from  more  than  fifty  recordings  of  air  jet  flows,  taken  at 
varying  distances  from  the  flow  nozzle.  Distances  varied  from  0.5  diameters  to  8.0  nozzle 
diameters  were  covered.  Conditions  of  air  jet  flow  included  forced  air  and  non-forced  air 
flow,  and  background  noise  recordings.  Angles  at  which  optical  distortion  were  taken 
included  an  x-direction  and  a  y-direction,  both  perpendicular  to  the  flow  of  the  air. 

Each  data  recording  constituted  one  time  series  to  be  learned  and  predicted  by  one 
neural  network.  The  neural  network  was  initialized,  by  generating  small  random  weights, 
before  training  began  on  each  time  series.  The  first  500  data  measurements  were  used 
as  the  training  set,  and  the  next  1000  measurements  were  a  testing  performance  set. 
A  verification  set  was  not  needed  to  provide  a  stopping  criterion  for  the  neural  network, 
because  the  network  training  was  terminated  after  20,000  iterations  automatically.  Future 
experiments  are  planned  in  which  a  verification  set  will  be  used  for  a  stopping  criterion, 
to  possibly  boost  performance.  Training  was  quite  successful  nevertheless  because  the 
relaxation  of  the  t)  parameter  effectively  stopped  training  at  relatively  favorable  times. 

Table  1  summarizes  the  performance  attained  by  the  trained  neural  networks.  The 
performance  reported  is  a  computation  that  reflects  the  percent  variation  in  the  data  that 
is  explained  by  network  predictions.  First  a  ratio  is  computed.  The  numerator  of  the  ratio 
is  the  root-mean-square  of  the  network’s  error  compared  to  its  target  values,  as  follows. 

R=/a/n)^k(d{h)-a{h)f  (8) 

where  d{tk)  is  the  target  value  at  time  tk,  and  a(4)  is  the  output  value  at  time  tk-  The 
denominator  of  the  ratio  is  the  standard  devation  of  the  target  values  (e.g.,  the  data 
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Table  1:  Performances  of  trained  neural  networks. 

measurements). 

S  =  yf{l/n)I,k{d{tk)  -  dmeanY  (9) 

where  dmean  is  the  average  target  (measurement)  value.  The  ratio  is  then  R/S,  and  the 
value  F  =  1  —  R/S  reflects  the  fraction  of  variation  in  the  data  that  the  network  explained 
through  its  predictions. 

Performance  varied  between  72%  and  93%  for  forced  and  non-forced  air  experiments. 
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Figure  4:  Optical  distortion  due  to  hot  air  flow,  forced  air  measured  at  8.0  diameters  from 
nozzle,  (a)  Measurements,  (b)  Prediction  by  trained  neural  network.  1500  data  points  in 
each  graph.  Orientation:  x  direction. 

The  background  measurements  were  typically  of  much  smaller  magnitude  compared  with 
the  air  flow  experimental  data.  For  most  of  the  background  trials,  the  neural  network  did 
not  converge  to  be  able  to  predict  the  noise.  However,  the  background  tends  to  oscillate 
irregularly,  and  in  some  cases  the  neural  network  converged  to  a  prediction  of  a  substantial 
portion  of  the  measurement  variations. 

Figure  4  shows  results  for  forced  air  flow  measurements  taken  at  8.0  diameters  from  the 
flow  nozzle.  Figure  4  (a)  shows  the  first  1500  measurements  taken,  from  the  x  direction 
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Figure  5:  Optical  distortion  due  to  hot  air  flow,  forced  air  measured  at  8.0  diameters  from 
nozzle,  (a)  Measurements,  (b)  Prediction  by  trained  neural  network.  1500  data  points  in 
each  graph.  Orientation:  y  direction. 

orientation,  and  Figure  4(b)  shows  the  predictions  for  those  measurements  provided  by 
the  trained  neural  network.  The  first  500  of  the  time  steps  shown  were  the  training  set, 
and  the  remainder  were  the  performance  testing  set.  The  prediction  follows  the  data 
extremely  well,  with  the  exception  that  at  some  of  the  peaks  the  prediction  does  not 
reach  the  magniture  of  the  data.  Figure  5  is  analogous  but  for  the  y  direction  orientation. 

Figure  6  shows  results  for  non-forced  air  flow  measurements  taken  at  8.0  diameters 
from  the  flow  nozzle.  Figure  6  (a)  shows  the  first  1500  measurements  taken,  from  the 
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Figure  6:  Optical  distortion  due  to  hot  air  flow,  non-forced  air  measured  at  8.0  diameters 
from  nozzle,  (a)  Measurements,  (b)  Prediction  by  trained  neural  network.  1500  data 
points  in  each  graph.  Orientation:  x  direction. 

X  direction  orientation,  and  Figure  6(b)  shows  the  predictions  for  those  measurements 
provided  by  the  trained  neural  network.  The  first  500  of  the  time  steps  shown  were  the 
training  set,  and  the  remainder  were  the  performance  testing  set.  The  prediction  follows 
the  data  extremely  well,  with  the  exception  that  at  some  of  the  peaks  the  prediction 
does  not  reach  the  magniture  of  the  data.  Figure  7  is  analogous  but  for  the  y  direction 
orientation. 
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Figure  7:  Optical  distortion  due  to  hot  air  flow,  non-forced  air  me£isured  at  8.0  diameters 
from  nozzle,  (a)  Measurements,  (b)  Prediction  by  trained  neural  network.  1500  data 
points  in  each  graph.  Orientation;  y  direction. 
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3  Research  Results 


The  primary  focus  of  this  research  is  to  predict  air  flow  distortions  in  as  realistic  a  fashion 
as  possible.  To  this  end,  we  wish  to  extend  our  experiments  to  deal  with  new  consider¬ 
ations  that  will  arise  when  distortions  are  corrected  by  an  array  of  tiny  mirrors  (MEM). 
Extention  of  previous  experiments  is  thus  underway,  and  includes  the  use  of  larger  data 
sets  and  varied  timing  schedules,  a  comparison  of  the  performance  of  the  neural  networks 
to  other  statistical  methods,  and  additional  research  on  alternative  dynamic  network  ar¬ 
chitectures  and  their  computational  capabilities. 

Initial  neural  network  runs  were  performed  on  data  with  500  successive  points.  Ex¬ 
periments  on  longer  data  sets  are  underway,  to  test  the  performance  observed  in  the 
preliminary  runs.  The  longer  data  sets  consist  of  up  to  45,000  points.  Each  data  set  will 
be  divided  into  a  training  set,  a  test  set,  and  a  verification  set.  The  test  set  will  be  used 
to  decide  when  to  stop  training  the  neural  network,  and  the  verification  set  will  be  used 
for  a  performance  report. 

The  rate  of  data-taking  is  10  kiloHertz.  Thus  the  45,000  data  points  take  4.5  seconds. 
The  ultimate  application  for  this  work  is  in  the  control  of  an  array  of  tiny  mirrors.  We 
estimate  that  the  mirrors  can  be  adjusted  at  a  rate  of  about  2  kiloHertz,  or  once  for  every 
5  data  points  measured.  Thus  a  time-series  prediction  scheme  must  be  devised  and  tested 
so  that  the  maximum  rate  of  data  is  exploited,  but  there  can  be  no  new  data  used  after 
the  command  to  adjust  a  mirror  is  issued.  Thus,  the  time  series  prediction  should  be 
tested  at  5  steps  ahead. 

We  have  completed  software  that  provides  pre-processing  of  the  data  to  set  up  ex- 
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periments  with  n-step-ahead  predictions.  The  user  can  choose  how  many  steps  ahead  to 
train  the  neural  network  to  predict.  Previous  experiments  were  limited  to  one-step-ahead 
prediction.  Additional  runs  have  been  done  at  a  5-step-ahead  interval,  and  performance 
was  still  surprisingly  good.  For  example,  a  training  run  was  done  in  which  trained  net¬ 
works  explained  81  %  of  the  variance  in  the  training  set,  and  74  %  of  the  variance  in  the 
performance  set.  The  data  for  this  run  was  taken  at  8  diameters  from  the  nozzle,  the 
longest  distance  and  thus  likely  the  hardest  to  predict. 

Other  timing  considerations  can  impact  on  how  a  neural  network  should  be  trained 
to  fit  into  a  controller  structure  for  an  array  of  tiny  mirrors.  For  example,  it  may  be 
beneficial  to  have  sparse  sampling  of  the  data  for  n-step  ahead  predictions,  where  n  >  1. 
For  example,  a  network  could  be  trained  on  data  that  is  sampled  every  5  time  steps  if  a 
5-step  ahead  prediction  is  to  be  done.  We  aim  to  perform  experiments  to  find  out  what 
is  the  best  timing  of  the  data  to  be  used  in  training  the  needed  predictors.  The  adaptive 
time-delay  neural  network  (ATNN)  developed  in  software  by  the  P.I.  is  an  auspicious 
technique  for  using  past  data  samples  in  prediction.  The  ATNN  allows  time  delays  to 
be  incorporated  in  the  neural  network,  so  that  past  values  of  the  time  series  data  can 
be  used  during  prediction.  Furthermore,  the  ATNN  automatically  evaluates  whether  the 
particular  time  delays  for  the  past  values  are  the  best,  or  whether  the  time  delays  should 
be  shifted  to  optimize  performance. 

New  software  has  been  developed  to  consider  the  difference  between  two  different 
components  of  the  prediction  -  the  value  of  the  previous  data  sample,  and  the  difference 
between  the  previous  data  sample  and  the  next  data  sample.  Thus,  we  can  test  the  neural 
network’s  predictive  power  of  the  two  components  separately.  When  the  neural  network 
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was  trained  to  predict  the  next  air  flow  distortion  value  in  a  time  series  of  data,  the 
network  flrst  captured  the  value  of  the  previous  data  sample  in  its  prediction  of  the  next 
data  sample.  Since  the  previous  data  sample  may  be  available  in  a  real-life  situation, 
we  can  subtract  that  away  and  train  the  neural  network  on  the  difference  between  the 
previous  sample  and  the  upcoming  sample.  Then,  the  neural  network’s  predictive  power 
will  be  most  challenged.  Our  new  software  reports  the  predictive  power  of  using  the 
previous  data  sample  as  the  prediction,  and  allows  us  to  compare  this  to  the  neural 
network’s  prediction.  Additional  pre-processing  software  prepares  the  data  so  that  the 
neural  network  can  be  trained  on  the  difference  between  previous  and  upcoming  values 
alone. 

Additional  experiments  that  include  different  training  criteria  and  schedules  are  de¬ 
sirable  and  will  be  done  in  the  future  if  time  permits.  These  experiments  would  include 
variations  in  performance  and  training  criteria,  such  as  training  to  minimize  the  absolute 
error,  varying  the  stopping  criteria,  varying  the  lengths  of  the  training  and  verification 
sets,  and  varying  the  neural  network  configurations. 

3.1  Comparison  of  neural  networks  to  traditional  statistical  meth¬ 
ods  for  time  series  prediction. 

It  is  important  to  compare  the  performance  of  the  neural  network  with  traditional  statis¬ 
tical  methods  for  time  series.  Our  aim  is  to  find  the  best  method(s)  for  prediction  rather 
than  to  insist  that  a  particular  pre-specified  approach  be  followed.  Ideally,  the  best  time 
series  prediction  method  that  is  not  a  neural  network  would  be  used  in  the  comparison. 
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and  then  its  performance  could  be  benchmarked  against  a  neural  network  approach. 

Other  investigations  have  explored  the  approach  of  linear  stochastic  estimation  (LSE) 
to  air  flow  predictions.  This  technique  relies  on  several  additional  measurements  being 
done  in  the  air  flow  simultaneously  to  the  prediction  that  is  performed.  Results  have 
been  reported  to  be  very  good  ([Lun96],  [LTM97],  [TMP+98]).  It  is  important  to  note 
that  this  technique  relies  on  additional  information  that  was  not  available  to  the  neural 
networks  trained  in  this  project.  The  additional  data  consists  of  several  time  series  of 
temperature  measurements  in  the  turbulent  air  flow.  A  new  experiment  could  be  tailored 
to  compare  the  neural  network  more  directly  to  the  LSE  approach  by  giving  the  neural 
network  the  same  additional  temperature  measurements.  We  could  then  test  whether 
the  neural  network  can  increase  the  performance  of  the  current  LSE  method,  or  whether 
adding  a  neural  network  to  the  LSE  method  would  increase  performance. 

In  a  realistic  real-life  situation,  however,  these  additional  temperature  measurements 
may  not  be  possible  to  get.  Then,  the  neural  network  could  be  the  best  technique  for 
prediction  because  it  does  not  require  additional  data  channels  before  a  prediction  can  be 
made.  However,  in  a  real  application,  any  data  that  is  possible  to  obtain  should  be  made 
available  to  the  neural  network. 

3.2  Alternative  dynamic  network  architectures  and  their  com¬ 
putational  capabilities. 

Neural  networks  with  time-delay  elements  are  used  in  this  study  to  do  time  series  predic¬ 
tion.  Previously,  these  networks  have  been  shown  to  learn  dynamic  behavior  by  training 
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to  produce  complex  oscillations  or  limit  cycles.  Training  of  TDNN  and  ATNN  networks 
has  been  demonstrated  on  a  chaotic  sequence  ([LDL94])  and  on  limit  cycle  attractors 
([LDL95]).  The  type  of  air  flow  data  analyzed  in  this  report  has  previously  been  shown  to 
contain  similar  dynamic  fluctuations  ([MVF95]).  Thus  the  TDNN  and  ATNN  approach 
is  highly  promising. 

We  would  like  to  explore  the  possibility  that  other  architectures  for  dynamic  neural 
networks  would  be  promising  for  selected  applications.  We  are  currently  researching  a 
single-layer  recurrent  network  with  random  weights  which  can  be  altered  to  evoke  chaotic 
behavior  spontaneously  without  training.  We  are  considering  the  problem  of  how  to  utilize 
the  attractors  (oscillations)  in  this  dynamic  neural  network  to  perform  pattern  recognition 
and  other  computational  tasks  ([PD95]). 

The  network  is  set  up  as  follows.  A  random  network  is  constructed  with  sparse  in¬ 
terconnections  in  a  single  layer  of  neurons.  A  multiplier  g  can  be  applied  to  all  of  the 
weights  at  the  same  time  and,  when  g  is  increased,  chaotic  behavior  occurs.  An  external 
pattern  is  then  applied  to  the  network  as  a  fixed  bias,  and  a  pattern-to-oscillation  map 
can  be  constructed.  Figure  8  illustrates  an  oscillation  from  a  64-neuron  network  after  an 
external  pattern  is  applied.  The  network  is  chaotic  in  the  absence  of  the  external  pattern. 

The  neural  units  are  simple  biologically-inspired  nodes,  performing  a  weighted  sum 
followed  by  a  nonlinear  squashing  function  /. 

aj(t  -Fl)  =  f{T,^^^gWjiai{t))  +  aCi  (10) 

where  g  in  (11)  is  the  multiplier  for  all  weights  in  the  network,  and  the  weights  are 
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randomly  assigned  with  variation  1/k  where  k  is  the  number  of  randomly  selected  inputs 
to  each  node.  E  is  the  fixed  external  pattern  applied  as  a  bias  to  the  network,  and 
is  the  strength  with  which  the  external  pattern  is  applied.  This  network  has  previously 
been  studied  and  theoretically  considered  ([DCQS93],  [DCQS95],  [QDS95],  [CDQS94], 
[SCS88]). 

The  P.I.  has  developed  a  method  for  evoking  a  pattern  -  to  -  oscillation  map  for 
this  network,  where  the  oscillations  are  constricted  to  be  either  a  finite-state  (n-state) 
oscillation  or  a  limit  cycle.  First,  a  network  with  no  external  pattern  (E=0)  is  applied, 
and  g  is  increased  to  produce  chaotic  behavior.  Then  an  external  pattern  E  is  applied. 
Transients  are  passed  by  iterating  the  network  through  (11)  above.  If  the  evoked  attractor 
satisfies  the  oscillation  criterion,  then  the  evoked  oscillation  is  the  result  of  the  pattern-to- 
oscillation  map  for  pattern  E.  If  the  evoked  attractor  is  too  chaotic,  then  the  strength  a 
must  be  increased  for  pattern  E.  If  the  evoked  attractor  is  a  fixed  point,  then  the  strength  a 
must  be  decreased  to  produce  more  complex  dynajnics.  Through  an  algorithm  for  making 
successive  adjustments  to  a,  the  pattern-to-oscillation  map  can  be  constructed  uniquely 
for  all  patterns,  given  the  initial  chaotic  network.  This  algorithm  has  been  proposed  by 
the  P.I.  and  constitutes  an  advance  in  the  development  of  new  dynamic  neural  networks. 

4  Impact  and  Future  Directions 

A  set  of  preliminary  runs  were  performed  on  neural  network  modeling  of  turbulent  air 
fiow  phenomena.  Neural  networks  with  time-delay  elements  were  trained  to  predict  the 
amount  of  optical  distortion  caused  by  the  flow  of  warm  air  out  of  a  nozzle,  in  an  environ- 
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(a) 


(b) 

Figurs  8.  Map  of  a  neural  network  oscillation.  The  network  had  64  neural  units,  with  16 
randomly  assigned  inputs  to  each  unit,  (a)  Plot  of  average  activation  a(t)  versus  a(t+l), 
at  successive  time  steps,  (b)  The  same  points  are  plotted  eis  in  (a),  but  successive  points 
are  connected  to  illustrate  that  the  limit  cycle  is  comprised  of  a  dense  set  of  points  along  a 
closed  curve.  One  thousand  transient  steps  were  skipped  before  the  pattern  was  applied, 
then  another  thousand  transient  steps  were  skipped  after  the  pattern  was  applied.  The 
graphs  show  the  next  thousand  points. 
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ment  where  the  air  is  cooler  and  optical  distortion  arises  from  refraction  at  the  boundary 
between  warm  and  cooler  air.  Neural  network  experiments  show  that  the  networks  con¬ 
verged  to  attain  considerable  predictive  power. 

Closer  examination  of  the  results  indicated  that  some  traditional  statistical  time  series 
methods  may  perform  well  on  the  predictive  task  attempted  by  the  neural  networks.  LSE 
methods  performed  well,  but  required  additional  data  to  be  taken  during  the  experiment. 

The  ultimate  aim  of  this  type  of  research  is  to  use  neural  networks  and  time  series 
predictions  to  control  an  array  of  tiny  mirrors  that  will  correct  for  optical  distortion 
from  the  atmosphere.  A  key  part  of  such  a  controller  is  a  model  that  predicts  upcoming 
distortion  values,  so  that  a  command  can  be  issued  to  individual  mirrors  to  compensate 
for  the  movement  of  optical  signals  across  the  grid,  due  to  air  distortion. 

We  also  extend  our  research  on  dynamic  neural  networks.  The  neural  networks  used 
in  this  study  contain  time-  delay  elements  and  can  be  trained  on  dynamically  changing 
data.  They  adapt  time-delays  to  optimize  the  information  used  from  the  past  for  the 
neural  network  prediction. 

Alternative  architectures  for  dynamic  neural  networks  show  flexibility  in  a  pattern- 
to-oscillation  map  and  will  have  increased  capacity  compared  to  static  neural  networks. 
These  results  provide  insight  into  the  potential  computational  capabilities  of  dynamic 
neural  network  computations. 
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Appendix 


Figures  9-13  show  some  example  calculations  resulting  from  the  dynamic 
network  described  in  Section  3.2. 


Progression  from  fixed  point  to  chaos  in  a  random  (64,16)  network.  The  horizontal  axis  is  average 
activation  at  time  i  +  1,  a(t  -  1),  and  the  vertical  axis  is  average  activation  at  time  t,  a(t).  (a)  g  =  0.9  (b) 
g  =  1.0  (c)  g  =  1.1  (d)  g  =  1.2  (e)  g  =  1.3  (f)  g  =  1.4  (g)  g  =  1.5  (h)  g  =  1.6  (i)  g  =  1.7 


Figure  9 
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A  chaotic  network  has  18  different  patterns  applied,  with  different  results  on  the  dynamics,  (a) 
Activity  of  the  chaotic  network,  before  an  external  pattern  is  applied.  Multiplier  g  is  set  just  above  the 
value  where  chaotic  activity  occurs.  (Here,  g  =  ....)  (b-s)  Activity  of  the  network  after  18  different  patterns 
were  applied.  Evoked  dynamics  is  highly  unique.  Most  graphs  show  recognizable  low-order  dynamics  -  limit 
cycles  and  n-state  oscillations  -  but  some  show  chaotic  (irregular)  behavior  and  others  show  fixed  points  only. 
Graphs  have  the  horizontal  axis  as  average  activation  at  time  t  + 1,  a(t  —  1),  and  the  vertical  axis  as  average 
activation  at  time  f,  a(t).  The  18  patterns  were  generated  from  a  uniform  random  distribution  [—1, 1]. 
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Figure  11 


0) 


A  chaotic  network  has  different  external  patterns  applied,  each  at  strengh  a  =  1.6.  (a)  The  base 
pattern,  with  64  entries,  each  from  a  uniform  random  distribution  (-1:1).  (b-k)  The  base  pattern  with  5  % 
noise  added,  to  make  10  different  variations  of  the  pattern. 


Figure  13 

The  base  pattern  (a)  and  its  ten  variations  (b-k).  Here  the  pattern  strength  a  was  1.8,  an  increase 
compared  with  Figure  14.  The  variation  between  the  evoked  attractors  is  less  compared  with  Figure  14, 
where  q  was  1.6.  i 
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Figure  14 

The  base  pattern  (a)  and  its  ten  variations  (b-k).  Here  the  pattern  strength  a.  was  1.2,  a  large 
decrease  compared  with  Figure  14.  The  variation  between  the  evoked  attractors  is  considerably  higher 
compared  with  Figure  14,  where  a  was  1.6. 
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COMPUTATIONAL  EVALUATION  OF  OPTICAL  SENSORS 


Ronald  R.  DeLyser 

Associate  Professor  of  Electrical  Engineering 
Department  of  Engineering 
University  of  Denver 

Abstract 

The  Temporal  Electromagnetic  Analysis  Code  (TEMAC3D)  and  the  Temporal  Scattering  and  Response 
(TSAR)  code,  two  Finite  Difference  Time  Domain  (FDTD)  codes;  and,  Ansoft  /xWavcLab,  a  Finite  Element 
Method  (FEM)  code  are  compared  using  models  of  the  Celestron-8  telescope  and  a  satellite  optical  sensor. 
Each  code  has  attractive  features  which  are  suited  to  large,  open  cavity  problems.  They  also  have  require¬ 
ments  that  limit  the  type  and  amount  of  output  data  or  increase  the  computational  expense  of  solving  these 
types  of  problems.  This  report  will  show  the  relative  merits  and  disadvantages  of  these  software  packages. 

The  TSAR  FDTD  solver  was  installed  at  the  University  of  Denver  on  an  IBM  RSCOOO,  however,  the  rest 
of  the  TSAR  suite  consisting  of  Anastasia  (the  mesh  generator),  Image  (the  mesh  viewer)  and  all  of  the 
post  processing  packages  were  not  installed  due  to  incompatibility  of  the  installation  i)ackage  with  the  IBM 
RSCOOO,  and  subsequent  lack  of  support  from  Lawrence  Livermore  National  Laboratory  (LLNL), 
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COMPUTATIONAL  EVALUATION  OF  OPTICAL  SENSORS 
Ronald  R.  DeLyser 


Introduction 

This  report  documents  a  continuation  of  other  projects  reported  in  [1],  [2]  and  [3].  It  is  also  complimentary 
to  another  study  funded  by  Phillips  Laboratory  and  reported  in  [4].  The  objective  of  those  studies  was 
to  determine  the  electromagnetic  response  of  a  Celestron-8  telescope  and  a  satellite  optical  sensor  using 
numerical  analyses.  The  analyses  used  the  Method  of  Moments  (MoM),  the  Finite  Element  Method  (FEM), 
and  the  Finite  Difference  Time  Domain  (FDTD)  method.  The  programs  used  were  CARLOS-3D  [5]  (MoM), 
the  Hewlett  Packard  High  Frequency  Structure  Simulator  (HFSS),  a  commercially  available  FEM  program, 
and  the  Temporal  Electromagnetic  Analysis  Code  (TEMAC3D)  [6]  an  FDTD  code.  The  inherent  problems 
with  CARLOS-3D  [3]  [7]  and  the  computational  expense  of  HP  HFSS  [3]  [8]  suggested  that  the  FDTD 
method  [9]  may  be  more  suited  to  these  studies.  TEMAC3D,  a  FORTRAN  program  that  is  based  on 
the  three  dimensional  implementation  of  the  FDTD  method  used  for  open  problems  (structures  that  are 
surrounded  with  open  space),  Wtis  used  to  analyze  models  of  the  Celestron-8  telescope  and  the  optical 
satellite  sensor. 

Fi'om  the  experience  gained  solving  large  problems  witli  all  of  these  codes,  some  of  the  conclusions  that 
were  reported  in  [1]  and  in  [10]  wore:  (1)  The  FEM  in  general  is  not  an  efiicieiit  method  for  evaluating 
frequency  response  of  large  complex  cavities.  It  can,  however,  bo  used  to  verify  testing  and  other  calcula¬ 
tions  for  radiation  problems  at  specified  frequencies.  Still,  large  computational  resources  arc  necessary  for 
HP  HFSS.  Models  of  reasonable  fidelity  arc  limited  to  relatively  low  frequencies.  (2)  Another  commercially 
available  FEM  package  was  identified,  Ansoft  Corporation’s  /tWavcLab,  which  can  take  advantage  of  sym¬ 
metry,  has  the  capability  of  using  second  order  elements,  seeded  meshing  and  can  do  the  radiation  problem 
with  symmetry.  (3)  An  FDTD  code  with  the  capabilities  of  perfect  magnetic  and  perfect  electric  boundaries 
would  allow  the  user  to  specify  ground  planes  or  apply  symmetry  to  problems,  thus  reducing  computational 
requirements.  This  capability  would  also  allow  the  modeling  of  the  sensor  on  a  ground  plane. 

While  modifications  of  TEMAC3D  to  achieve  the  capabilities  of  the  desired  FDTD  code  were  beyond 
the  scope  of  the  present  study,  evaluation  and  minor  modifications  of  the  Temporal  Scattering  and  Response 
(TSAR)  code,  another  FDTD  program  for  open  problems  from  Lawrence  Livermore  National  Laboratory 
(LLNL)  were  done.  The  capabilities  that  TSAR  had  beyond  those  contained  in  TEMEC3D  were  imple¬ 
mentation  of  perfect  magnetic  and  perfect  electric  boundaries.  Results  of  scattering  problems  using  these 
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capabilities  in  TSAR  are  reported  here  and  compared  to  the  TEMAC3D  results.  Unfortunately,  there  were 
problems  with  TSAR  for  radiation  simulations  so  that  no  results  can  be  presented. 

Installation  of  Software  at  The  University  of  Denver 

Prior  to  doing  the  analysis  of  the  Celestron-S  and  the  optical  sensor,  certain  pieces  of  software  were 
identified  for  installation  at  The  University  of  Denver.  These  were  BRL-CAD,  the  TSAR  software  suite  (the 
TSAR  FDTD  solver,  Anastasia  and  Image)  and  Ansoft’s  /iWavcLab. 

Problems  with  installation  of  BRL-CAD  and  the  TSAR  suite  on  the  AIX  based  IBM  RSGOOOs  came 
quickly.  Installation  of  BRL-CAD  was  eventually  accomplished  by  Bob  Strausser  of  the  SURVICE  Engi¬ 
neering  Company.  The  installation  of  the  TSAR  suite  was  abandoned  because  of  installation  problems  and 
the  lack  of  support  from  LLNL.  The  personnel  who  were  once  at  LLNL  and  who  created  and  supported 
TSAR  are  no  longer  there.  Therefore,  a  modified  version  of  the  TSAR  FDTD  solver  was  acquired  from 
Ernie  Baca  and  Jeff  MacGillvory  at  Sandia  Labs.  The  installation  of  that  version  was  successful.  However, 
use  of  Anastasia  (the  mesh  generator)  and  Image  (the  mesh  viewer)  on  a  Sun  workstation  located  at  Phillips 
Laboratory  must  continue. 

There  were  no  problems  installing  or  running  /AViiveLab.  It  was  leased  from  Ansoft  with  allocated  funds 
from  this  contract.  This  purchji.se  also  made  it  possible  to  switch  from  HP  HFSS  to  Ansoft  HFSS  at  no  cost. 
In  the  future  all  of  the  features  of  /tWaveLab  will  be  incorporated  into  Ansoft  HFSS. 

Software  Capabilities 

TEMAC3D  requires  a  suite  of  peripheral  software  to  generate  solid  models,  generate  the  Finite  Differ¬ 
ence  (FD)  mesh,  view  the  mesh  for  consistency,  generate  the  problem  namelist  and  header  files,  and  view  the 
results  of  the  analysis.  The  solid  model  geometry  file  is  generated  by  BRL-CAD,  available  from  Ballistics 
Research  Laboratory,  Aberdeen  Proving  Ground,  MD.  The  geometry  file  is  then  input  to  ANASTASIA  which 
generates  a  mesh  given  the  number  of  “pad”  cells^  and  the  size  of  the  cell.  IMAGE  is  used  to  view  the  mesh 
to  be  sure  that  the  FD  mesh  is  really  the  desired  mesh.  This  mesh  file  is  then  an  input  for  XTEAR  (X- 
window  Temporal  Electromagnetic  Analysis  and  Response)  [C]  which  is  a  graidiical  user  interface  designed 
for  input  of  all  relevant  parameters  for  the  FDTD  simulations  using  TEMAC3D.  Finally,  software  is  needed 
to  view  output  fields  as  a  function  of  time,  frequency  and/or  position,  to  generate  far-field  scattering  and/or 
radiation  plots,  and  to  do  Fast  Fourier  Transforms  (FFTs)  in  order  to  determine  the  frequency  response. 
FORTRAN  programs  were  provided  by  Dr.  John  Beggs  for  far  zone  processing  and  radiation  plots. 

*Pad  cells  occupy  the  space  between  the  outermost  cells  of  the  modeled  object  and  the  outermost  cells  of  the  problem  space 
which  implement  the  absorbing  boundary  conditions. 
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A  Gaussian  pulse  with  plane  wave  excitation  is  used  for  scattering  simulations  of  the  sensor  and  the 
Celestron-8.  Point  sensors  are  located  inside  the  cavity  to  determine  the  time  domain  response  for  a  time 
believed  to  be  long  enough  to  reach  steady  state.  Mat  head  is  then  used  to  FFT  the  data  to  the  frequency 
domain.  Presentation  of  the  data  can  be  in  the  form  of  graphs  taken  directly  from  Mathcad  or  the  data  can 
be  further  processed  using  MathSoft  Axum. 

For  radiation  simulations,  the  excitation  is  point  source  with  a  ramped  sinusoidal  signal.  Equivalent 
surface  currents  are  calculated  at  a  surface  surrounding  the  radiator.  These  equivalent  surface  currents  are 
then  written  to  a  file  for  post  processing  by  another  program  called  “patterns”.  With  patterns,  desired 
cut  planes  are  specified  either  in  elevation  or  azimuth.  For  this  study,  angular  sweeps  are  specified  in  the 
Q  direction  (angle  measured  from  the  z-axis  -  elevation)  at  constant  values  of  (f)  (angle  measured  from  the 
x-axis  -  azimuth).  If  another  cut  plane  is  needed,  “patterns”  is  run  again  using  the  surface  current  data 
to  generate  the  far  field  pattern  on  the  specified  cut  plane.  If  the  far  field  pattern  for  another  frequency  is 
needed,  TEMAC3D  must  be  run  again  with  the  ramped  sinusoid  at  the  new  frequency.  The  radiation  data 
is  plotted  using  Axum. 

Fields  along  slices  internal  to  the  cavity  are  used  to  deteniiine  resonance  modes.  Field  and/or  current 
slice  sensors  can  be  placed  in  any  plane.  Any  total  or  scattered  field  or  current  quantity  can  be  defined  for 
the  slice  sensor.  The  only  disadvantage  to  this  capability  is  that  some  “intuitive  guessing”  is  required  for 
placement  and  type  of  sensors. 

The  TSAR  electromagnetic  modeling  system  consists  of  a  family  of  related  codes  which  have  been 
designed  to  work  together  to  provide  the  user  with  a  practical  way  to  set  up,  run,  and  interpret  the  results 
from  a  3-D  electromagnetic  simulation.  The  various  codes  in  the  TSAR  system  have  been  in  dev^elopment  at 
LLNL  since  1987  and  in  active  use  since  1988.  TSAR  uses  FDTD  to  solve  Maxwell’s  equations  in  3-D  on  a 
rectangular,  Cartesian  grid.  As  with  TEMAC3D,  BRL-CAD  is  used  to  generate  the  solid  model;  Anastasia 
and  Image  arc  used  to  create  and  view  the  mesh;  the  FDTD  solver  is  used  to  do  the  analysis.  A  number  of 
post-processing  tools  are  also  available.  An  interactive  signal  processing  code  developed  at  LLNL  is  used  to 
plot  field  values  at  a  point  vs.  time.  The  package  allows  data  to  be  scaled,  shifted  and  Fourier  transformed. 
Contour  and  surface  plotters  beised  on  the  commercial  graphics  package  CA-DISSPLA  can  be  used  to  plot 
a  slice  of  data  through  the  3-D  simulation  volume  in  either  a  contour  or  surface  format.  These  codes  are 
primarily  used  to  provide  snapshots  in  time  of  field  patterns.  There  is  also  a  group  of  works  tat  ion- based 
codes  that  can  be  used  to  interactively  examine  a  time  sequence  of  2-D  field  data  and  to  plot  this  data  on 
videotape.  Unfortunately,  none  of  these  post-processing  codes  could  be  evaluated  because  of  the  problems 


6-5 


encountered  during  installation  of  the  LLNL  release  of  TSAR. 

The  differences  in  capabilities  between  TEMAC3D  and  TSAR  arc  few  but  significant.  TSAR  has  first 
and  second  order  MUR  absorbing  boundaries  (TEMAC3D  uses  second  order  Liao)  and  is  also  capable  of 
using  perfect  electric  and  perfect  magnetic  boundaries.  The  same  jirocedure  for  the  scattering  problem  that 
was  used  for  TEMAC3D  is  used  for  TSAR.  For  the  radiation  problem,  the  input  can  be  any  of  the  excitations 
used  for  scattering  problems  which  includes  a  ramped  sinusoid.  Use  of  the  time  domain  excitations  allow  the 
user  the  capability  of  specifying  a  number  of  frequencies.  A  virtual  surface  is  defined  over  which  equivalent 
currents  are  calculated.  These  equivalent  surface  currents  are  then  used  as  sources  for  calculated  time  domain 
near  and  far  fields  at  specified  locations  or  Fast  Fourier  Transformed  far  fields  to  produce  frequency  domain 
responses  for  cut  planes  or  points.  All  of  these  calculation  are  done  within  the  TSAR  code  itself.  Outputs 
for  radiation  problems  are  in  three  forms:  (1)  frequency  domain  far  field  (f>  and  6  components  of  the  electric 
fields  at  specific  points,  (2)  time  domain  near  field  (/>  and  6  components  of  the  electric  fields  at  specific  points, 
and  (3)  frequency  domain  far  field  (f>  and  6  components  of  the  electric  fields  at  specific  points  designated  by 
a  sweei)  through  specified  values  of  at  a  constant  value  of  0. 

Radiation  plots  are  usually  generated  for  aperture  antennas  with  tlie  normal  vector  of  tlie  aperture  in 
the  z  direction.  Angular  sweeps  of  tlie  field  quantities  or  the  power  radiated  are  then  done  in  tlie  0  direction 
at  constant  values  of  </>.  This  inconsistency  with  what  TSAR  does  was  overcome  by  reprogramming  the 
relevant  sulrroutines  so  that  tlie  patterns  produced  arc  a  function  of  0  at  constant  values  of  (p.  However, 
the  outputs  for  a  particular  run  are  specified  far  field  points  in  the  time  domain,  or  railiation  patterns  for 
specified  frequencies.  If  a  different  pattern  or  far  field  point  is  needed,  the  problem  has  to  be  run  again.  To 
plot  the  radiation  data,  a  separate  commercial  graphing  program  is  necessary.  Unfortunately,  this  version 
of  TSAR  had  difiicultics  with  the  radiation  problem  so  those  simulations  gave  incorrect  or  no  results. 

As  it  is  for  TEMAC3D,  slice  sensors  can  be  placed  in  any  plane.  Any  total  or  scattered  field  or  current 
quantity  can  be  defined  for  the  slice  sensor.  The  same  “intuitive  guess  work”  as  for  TEMAC3D  must  be 
accomplished  for  placement  and  selection  of  type  of  these  slice  sensors. 

/iWaveLab  is  a  comprehensive,  full-wave,  3D  electromagnetic  simulation  and  analysis  system  that  ad¬ 
dresses  arbitrary  structures  providing  all  field  information.  A  comprehensive  set  of  3-D  elements  is  available 
in  linear  and  quadratic  accuracy.  Fast,  efficient,  absorbing  boundaries  emulate  infinity  to  accommodate 
unbounded  models.  Solid  modeling,  auto-meshing,  and  push-button  calculations  arc  provided. 

With  the  time  domain  codes,  generating  a  frequency  response  to  an  external  stimulus  identifies  frequencies 
of  interest  for  further  investigation  by  solving  the  radiation  problem.  With  /AVaveLab,  the  capability  of  doing 
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the  scattering  problem  with  an  incoming  plane  wave  does  not  exist.  Instead,  the  radiation  problem  is  done 
by  exciting  the  structure  as  an  antenna  and  the  port  scattering  parameter,  5n,  is  calculated  at  a  number  of 
frequencies.  This  gives  us  the  desired  response  needed  to  identify  frequencies  of  interest. 

All  radiation  data  at  all  angles  for  all  frequencies  of  the  frequency  sweep  can  be  generated.  This  takes 
an  enormous  amount  of  time  (compared  to  turning  off  the  radiation  data  production)  and  generates  an 
enormous  amount  of  data,  most  of  which  is  not  of  interest.  A  two  step  process  of  identifying  frequencies 
via  Sn  followed  by  doing  the  problem  at  the  frequencies  of  interest  with  the  radiation  data  production  on 
is  accomplished.  Post  processing  produces  radiation  data  at  any  angle  of  interest  for  each  of  the  frequencies 
selected  for  the  analysis.  Since  /rWaveLab  is  a  frequency  domain  simulator,  a  run  for  each  frequency  is 
necessary.  The  plots  of  the  radiation  pattern  can  be  done  within  WaveLab.  Since  there  is  no  easy  way  to 
format  the  data  for  commercial  graphing  programs,  the  plots  are  captured  with  a  screen  capture  utility  and 
saved  for  inclusion  in  reports.  Field  and  current  quantities  on  surfaces  that  are  defined  during  the  geometric 
modeling  are  stored  and  can  be  viewed  with  the  post  processing  capabilities  of  WaveLab.  As  with  the  codes 
pre.sented  above,  a  certain  amount  of  “engineering  intuition”  is  needed  for  placement  of  these  surfaces. 

TEMAC3D  D^xselino  Results*^ 

Larger  models  for  the  Celestron-8  and  the  optical  sensor  than  were  reported  in  [1]  and  [2]  were  generated 
and  simulated  using  TEMAC3D.  These,  along  with  radiation  calculations  for  the  optical  sensor  reported  in 
[1]  [2]  and  measurements  on  the  Celcstrou-8  reported  in  [3]  were  the  baseline  results  with  which  the  results 
from  the  other  simulators  arc  compared.  A  drawing  of  the  Cclestrou-8  is  shown  in  Figure  1.  A  large  model 
of  the  Celcstroii-8  which  incorporates  the  loss  in  the  Plexiglass  window  (dielectric  constant,  eji  =  2.6  and 
conductivity,  cr  =  .002  S/ni)  and  includes  the  glass  {ca  =  4.25  and  a  =  .002  S/m)  behind  the  primary 
mirror  was  created.  The  resulting  mesh  is  shown  in  Figure  2.  The  scattering  problem  was  done  with  an 
x-polarizcd,  Gaussian  shaped  pulse  plane  wave  incident  on  borcsight.  Field  point  sensors  are  located  at 
various  points  inside  the  telescope  model.  Figure  3  shows  a  comparison  of  the  normalized  magnitudes  of  the 
electric  fields  at  the  eyespiece  location,  the  probe  location,  and  the  measured  data  reported  in  [3].  In  the 
low  frequency  range,  the  measured  data  track  the  probe  location  data  fairly  well,  even  though  the  measured 
data  are  taken  at  the  location  of  the  eye  piece.  The  Phillips  lab  testers  believe  that  this  can  be  explained 
by  poor  isolation  between  the  large  cavity  region  and  the  location  of  the  test  sensor  at  the  eye  piece.  The 
resonances  at  1.02  GHz,  1.45  GHz,  l.Gl  GHz  and  1.7G  GHz  were  investigated  with  an  x  directed  point  source 

^Results  that  correspond  to  pWaveLab  and  TSAR  results  are  presented  here.  Other  results  arc  reported  in  [4]. 
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Figure  3:  Comparisons  of  numerical  and  measured  results. 

at  the  probe  location  specified  for  the  mciusurements  reported  in  [3].  The  radiation  patterns  for  1.02  GH'/ 
and  1.45  GHz  in  the  xz  ((/>  =  0)  and  yz  {(j)  =  90)  planes  are  shown  in  Figure  4. 

The  inodeF  of  the  sensor  wiis  generated  with  cell  sides  of  O.CG  mm  (see  Figure  5).  The  scattering  problem 
was  done  with  a  Gaussian  jiulsc  type,  that  was  a  y-polarized  plane  wave  incident  on  borcsight.  Field  sensors 
were  placed  at  five  different  locations  in  the  sensor  model.  The  time  domain  information  for  all  of  these 
locations  showed  the  characteristic  exponential  decay  indicative  of  a  valid  simulation.  The  frequency  domain 
plots  for  the  detector  and  cavity  center  locations  are  shown  in  Figures  G  and  7.  The  results  from  this  large 
model  compare  favorably  with  the  results  reported  in  [1]  for  the  smaller  model. 

The  radiation  problem  for  frequencies  of  3.58  GHz  and  6.47  GHz  are  reported  in  [1]  and  [10].  Since  the 
smaller  model  of  that  report  was  of  sufficient  detail  for  accurate  results,  those  results  are  reproduced  here 
(Figures  8  and  9)  for  completeness  and  comparisons  with  results  from  the  other  software  packages.  Even 
though  the  model  of  the  sensor  was  analyzed  and  reported  in  [Ij.  the  fields  internal  to  the  sensor  were  not 
recorded.  Figures  10  and  11  show  field  magnitudes  for  slice  sensors  in  the  xy  plane  in  the  center  of  the  large 
cavity. 

^Details  of  the  evolution  of  this  model  are  given  in  [1]. 
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Figure  5:  Cross-sect  ion  of  tlu'  large  model  mesh  for  tlie  sensor. 
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Figure  G:  Normalized  total  electric  fields  fur  the  detector  location. 
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Figure  7:  Normalized  total  electric  fields  for  the  cavity  center  location. 


6- 


Comparisons  of  the  transverse  fields  in  the  center  of  the  cavity  can  be  made  with  the  mode  chart  of  [11). 
At  3.86  GHz  the  TEn  mode  is  evident  for  both  polarizations.  At  6.47  GHz,  TEqi  is  the  clear  choice  for 
y  polarization,  however,  the  x  polarization  shows  no  distinct  mode.  A  closed  cavity  with  the  dimensions  of 
the  large  central  cavity  region  has  resonance  frequencies  for  the  TEm  and  TEou  modes  of  3.576  GHz  and 
6.431  GHz  respectively.  Considering  the  fact  that  the  large  cylindrical  cavity  region  of  the  sensor  is  far  from 
being  a  closed  right  circular  cylinder,  these  calculated  frequencies  are  remarkably  close  to  those  found  in  the 
FDTD  analysis. 

Figures  12  -  15"*  show  electric  field  distributions  in  the  xz  plane  (plane  of  the  source),  and  in  the  yz  plane. 
Comparing  these  figures  shows  that  the  field  in  the  outer  toroidal  cavities  are  much  more  intense  for  the  x 
polarized  source  than  for  the  y  polarized  source  for  all  frequencies  except  6.47  GHz  and  in  both  xz  and  yz 
planes.  This  may  be  explained  by  observing  that  Figure  5  shows  the  mesh  for  the  xz  plane.  An  x  polarized 
source  at  the  detector  location  should  produce  greater  z-directed  fields  because  of  reflection  from  the  lower 
v-shaped  filter  area,  whereas  a  y-polarized  source  would  not.  These  z-dirccted  fields  easily  propagate  in  the 
entrances  to  the  outer  toroidal  sections. 

Another  notable  feature  of  these  field  distributions  is  that  for  all  but  6.47  GHz,  the  distributions  show 
nulls  at  the  lens,  indicating  the  relatively  high  conductivity  and  dielectric  constant,  and  thus  high  reflectivity, 
of  this  material.  This  information  can  be  u.sed  for  the  third  mode  number,  the  number  of  half  wavelengths 
along  the  length  of  the  cavity,  for  resonance  frequencies.  The  assumption  of  TiTni  TEqu  modes  at  3.58 
GHz  and  6.47  GHz  (y  polarization  only)  seems  correct  since  there  is  only  one  half  wavelength  variation  in 
the  fields  for  these  resonances. 

TSAR  Results 

The  first  use  of  TSAR  produced  niuncrous  runs  on  a  full  (no  use  of  symmetry  or  a  ground  plane)  model  of 
the  Celestron-8.  The  only  difference  between  the  TSAR  model  and  the  TEMAC3D  model  is  that  7  pad  cells 
were  used  for  TSAR  and  its  2nd  order  Mur  radiation  boundaries  as  opposed  to  5  pad  cells  for  TEMAC3D 
with  its  2nd  order  Liao  radiation  boundaries.  A  sample  of  the  output  x«directcd  fields  at  the  eyepiece 
location  and  at  the  probe  location  arc  shown  in  Figures  16  and  17.  These  are  the  same  positions  as  for 
the  TEMAC3D  fields.  The  exponentially  increasing  DC  component  at  the  probe  locations  continues  to  grow 
for  longer  run  times  and  is  physically  not  reasonable.  Use  of  10  pad  cells,  replacement  of  the  glass  below 
the  primary  mirror  with  air,  use  of  perfect  conductors  as  opposed  to  copper  conductors,  implementation  of 

"'The  scale  fur  the  magnitude  of  the.se  fields  has  been  set  tu  empha.size  the  field  variations  at  locations  away  from  the  source. 
As  such,  the  fields  in  locations  near  the  source  have  been  clipjjcd  at  the  maximum  scale  reading. 
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Figuro  14:  Electric  field  in  the  yz  plane  for  x  and  y  polarizations  -  3.58  GHz. 


Figure  15:  Electric  field  in  the  yz  plane  for  x  and  y  polarizations  -  G.47  GHz. 
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longer  pulse  times  for  the  Gaussian  pulses,  and  specifying  a  smaller  Courant  number^  of  1/3  produced  little 
if  no  change  in  this  erroneous  output. 

These  problems  continued  with  a  full  model  of  the  sensor.  In  addition  to  the  variations  cited  above,  use 
of  a  point  radiator  as  opposed  to  a  plane  wave  source  yielded  similar  results  -  exponentially  increasing  DC 
components  of  certain  field  components.  The  decision  was  made  to  create  models  of  the  Celestron-8  and  the 
sensor  that  took  advantage  of  symmetry  and,  in  the  case  of  the  sensor,  add  a  ground  plane  at  the  base  of 
the  sensor  to  simulate  the  large  body  of  the  satellite.  Anastasia  was  used  to  mesh  1/2,  1/4  or  1/8  of  the 
full  geometric  model  depending  on  the  symmetry  desired.  Appropriate  boundary  conditions  (perfect  electric 
planes,  perfect  magnetic  planes  and  absorbing  boundaries)  are  then  defined  in  the  namelist  file,  “tsarin”. 
For  the  scattering  problem,  plane  waves  are  defined  as  for  the  fully  meshed  models.  These  simulations 
were  successful  and  are  reported  below.  It  appears  that  for  these  particular  scattering  problems  using  the 
Celestron-8  and  sensor  models,  TSAR  does  not  do  well  with  absorbing  boundary  conditions  defined  on  all 
of  the  outer  surfaces  of  the  computational  space. 

Tlie  symmetric  model  of  the  Celestron-8  was  excited  with  an  x  polarized  plane  wave  incident  on  the 
window  of  the  telescope.  This  is  similar  to  the  TEMAC3D  model  reported  in  [1]  cxcei^t  for  the  symmetry 
planes.  For  the  x  polarized  electric  field  excitation,  the  xz  plane  (the  plane  containing  tlie  electric  field  vector) 
is  defined  as  a  perfect  magnetic  conductor.  The  yz  plane  i.s  then  defined  as  a  perfect  electric  conductor.  The 
response  for  this  model  is  shown,  along  with  mea.sured  results  from  [3],  in  Figure  18.  It  is  similar  to  the 
TEMAC3D  results  (Figure  3)  but  is  not  exactly  the  same.  This  is  to  be  expected  because,  in  addition  to  the 
use  of  symmetry  and  a  ground  plane  for  the  TSAR  simulation,  TEMAC3D  uses  2nd  order  Liao  absorbing 
boundaries  and  TSAR  uses  2nd  order  Mur  aUsorbing  boundaries.  These  results  show  that  TSAR  is  working 
well  with  symmetric  models.  Since  this  is  the  main  reason  for  using  TSAR  in  the  first  place,  TSAR  can  be 
used  with  confidence  in  future  work  where  symmetry  can  be  aj^plicd. 

Unfortunately,  there  were  no  valid  radiation  simulation  results.  TSAR  was  modified  so  that  it  would 
produce  files  for  specific  values  of  tf)  for  (p  and  9  polarized  electric  fields.  The  information  in  each  file  would 
be  electric  field  as  a  function  of  6.  The  code  was  run  using  a  ground  plane  and  one  plane  of  symmetry, 
resulting  in  core  dumps  with  no  error  codes  generated.  The  pattern  files  were  not  written  to  disk  and  the 
far  field  files^  were  filled  with  NaNQ  which  is  an  overflow  or  underflow.  The  analysis  wtvs  run  again  using 
the  original  unmodified  subroutines  with  the  same  results.  A  sine  wave  excitation  was  then  used  along  with 

^TSAR  normally  uses  a  Courant  number  of  1/2  whorciis  TEMAC3D  uses  l/\/3.  This  is  due  to  the  differing  radiation 
boundary  conditions. 

®Thesc  files  contain  time  domain  information  on  the  electric  fields  at  specifed  angles  in  the  far  field. 
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Figure  18:  PliUie  wave  response  of  the  Celcstron-8. 

defining  ‘'slice  sensors”.  In  addition  to  the  same  results  described  above,  these  files  contained  field  values  of 
“0”  throughout.  To  date,  there  is  no  solution  for  this  problem. 

The  1  mm  mesh  model  of  the  sensor  that  was  used  for  TEMAC3D  was  also  used  for  TSAR  with  the 
exception  that  symmetry  and  8  pad  cells  were  used  with  TSAR.  One  symmetric  model  for  TSAR  defines 
two  symmetry  planes,  one  electric  and  the  other  magnetic,  so  that  only  1/4  of  the  full  model  is  meshed. 
Results  for  the  thermopile  detector  location  for  this  TSAR  model  are  sliown  in  Figure  19.  The  results  arc 
again  similar  to  the  TEMAC3D  results  shown  in  Figure  G.  Of  course,  the  presence  or  absence  of  the  ground 
plane  should  not  significantly  affect  the  results  for  the  scattering  problem.  The  results  are  similar  to  the 
TEMAC3D  results  (Figure  7)  as  expected. 

A  similar  symmetric  model  with  a  ground  plane  located  at  the  bottom  of  the  sensor  was  also  created. 
Figure  20  shows  the  response  at  the  center  of  the  large  cavity  region  of  the  sensor  for  the  TSAR  symmetric 
model.  The  TSAR  symmetric  model  with  the  giound  plane  results  are  almost  identical  to  these. 

In  all  of  the  TSAR  results  given  thus  far,  the  models  give  zero  field  intensity  for  the  x  and  z  components 
at  the  locations  noted.  Since  these  locations  are  on  a  “perfect  electric  conductor”  plane,  this  w'ould  naturally 
be  the  case.  The  presumption  is  then  that  the  full  model  used  for  the  TEMAC3D  simulation  does  not  have 
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Figure  19:  Plane  wave  response  at  the  detector  location  of  the  Sensor. 
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Figure  20:  Response  in  the  large  cavity  region  of  the  Sensor  without  ground  plane 


perfect  symmetry  and  that  small  field  intensities  in  the  x  and  z  directions  are  generated. 

/iWaveLab  Results 

Methods  of  running  WaveLab  to  best  utilize  the  software  in  a  timely  manner  were  explored.  The  manual 
solution  option  in  ^WaveLab  allows  the  analysis  to  be  run  outside  the  graphical  user  interface.  This  increases 
the  user’s  time  efficiency  in  two  ways.  It  allows  the  user  to  work  on  a  completely  different  model  while  the 
analysis  is  running  in  the  background  and  it  also  permits  batch  processing  so  that  one  analysis  may  fall 
on  the  heels  of  another  when  the  user  is  unavailable  to  begin  the  next  analysis.  In  order  to  run  a  solution 
manually,  “manual  analysis”  must  be  selected  on  the  analysis  menu  and  a  file  name  must  be  assigned  when 
prompted.  That  file  can  then  be  run  at  any  time.  Batch  processing  may  be  achieved  by  writing  UNIX  script 
files  containing  each  file  to  be  analyzed. 

/AVaveLab  has  tlie  capability  of  determining  port  scattering  parameters,  radiation  plot  parameters  in 
any  direction  and  field  quantities  at  predetermined  locations  at  all  of  the  selected  analysis  frequencies. 
The  locations  for  field  quantities  are  defined  when  the  geometric  model  is  created  and  are  on  surfaces  of 
volumes.  This  differs  from  HFSS  in  that  the  port  scattering  parameters  and  radiation  and  field  quantities 
are  known  everywliere  for  a  single  frequency  run  onl^'.  The  scattering  parameters  only  are  giuuuated  for  a 
multiple  frequency  sweep.  After  resonances  arc  found  from  the  port  scattering  parameters,  a  single  frequency 
analysis  at  the  resonance  frequencies  to  determine  field  distrii^utions  and  radiation  plots  is  done. 

/i WaveLab  has  the  capability  of  solving  for  the  eigenvalues  (resonance  frequencies)  of  a  closed  cavity.  This 
capability  has  been  used  on  two  models,  tlie  actual  geometry  of  the  Cclestron-8  telescope  and  on  a  simplified 
gixmietry  of  the  sensor.  The  assumption  that  all  outer  walls,  including  the  aperture,  of  the  geometry  are 
conductors  is  necessary  for  the  eigenvalue  problem.  The  first  resonance  frequency  for  the  Cclcstron-8  W£vs 
found  to  ]>e  0.42218  GHz.  This  docs  not  agree  with  results  using  TEMAC3D  where  the  first  resonance  is 
at  approximately  1  GHz.  The  simplified  geometry  of  the  sensor  cross-section  is  taken  to  be  only  the  large, 
central,  cylindrical  region.  Many  of  the  resonance  frequencie.s  for  the  sensor  are  clo.se  to  those  found  using 
TEMAC3D  but  there  are  at  least  as  many  additional  frequencies  found  using  /AVaveLab.  The  purpose  of 
this  exercise  was  to  see  if  the  eigenvalue  solver  could  be  used  to  predict  resonances  of  the  large  sensors 
presented  here.  In  fact,  calculation  of  a  few  of  the  lower  frequency  resonances  assuming  purely  cylindrical 
cavities  seems  to  be  more  effective.  It  was  concluded  that  the  eigenvalue  solver  is  not  a  real  asset  for  these 
types  of  problems. 

Two  models  for  Celcstron-8  simulations  were  created,  one  with  a  probe  excitation  located  in  the 
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Figure  21:  Refaction  cocflicicut  for  the  Cclostrou-8  niodol. 

large  cylindrical  region  of  the  cavity,  and  the  other  with  waveguide  excitation  at  the  eyepiece  location.  Tlie 
model  for  the  Cclcstron-8  is  the  same  as  that  used  in  HP  HFSS  and  simulates  tests  at  Plfillips  Lab  [3).  The 
goal  of  the  modeling  is  to  take  advantage  of  /iVVavcLab’s  ability  to  use  symmetry  with  radiation  as  well  as 
its  cpiadratic  clement  meshing  to  reduce  the  size  of  the  proljleni.  After  performing  a  frequency  sweep  from 
1  to  1.5  GHz,  a  resonance  in  the  correct  range  and  the  general  shape  of  S\\  versus  frequency  (Figure  21)  is 
observed  to  agree  with  results  from  HP  HFSS  reported  in  [3j. 

The  radiated  power  pattern  for  1.35  GHz  is  shown  in  Figure  22.  Even  though  this  plot  is  for  a  frequency 
where  the  telescope  is  not  resonant,  the  values  arc  similar  to  the  results  reported  for  HP  HFSS  [1][8].  Figure 
22  can  also  be  compared  with  the  results  from  TEMAC3D  of  Figure  4  at  1.45  GHz.  Again,  the  frequencies 
are  not  exactly  the  same,  but  results  arc  similar.  Satisfaction  with  the  capabilities  of  ^AVaveLab  and  the 
validity  of  this  model  is  high. 

The  Celcstron-8  was  also  modeled  with  a  circular  waveguide  excitation  located  at  the  eye  piece.  This 
effectively  creates  a  model  for  frequencies  above  the  cutoff  frequency  for  this  waveguide,  4.39  GHz.  Attempts 
at  running  the  analysis  failed  due  to  a  meshing  problem  that  remained  after  several  attempts  at  fixing  the 
problem.  The  models  contained  approximately  20,000  elements  and  were  likely  to  fail  during  the  analysis 
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Figure  22:  Power  racUatod  for  the  Celcstroii-8  model. 

due  to  lack  of  resources  anyway.  Attention  was  diverted  to  developing  a  full  model  of  the  satellite  sensor. 

Two  models  for  the  sensor  simulations  were  created:  one  Wivs  a  simplified  model  which  did  not 
contain  the  outer  toroidal  sections,  the  other  was  similar  to  the  model  created  for  TEMAC3D  [10].  An 
added  benefit  to  this  approach  is  that  the  effects  of  the  outer  toroidal  sections  can  be  evaluated.  The  sensor 
was  modeled  so  that  the  resonances  at  3.58  GHz  and  C.47  GHz  that  were  found  using  TEMAC3D  [1]  could 
be  investigated.  Both  H  parallel  and  E  parallel  planes  of  symmetry^  (the  plane  of  symmetry  for  this  model 
is  the  yz  plane)  were  u.sed  witii  the  appropriate  rectangular  waveguide  excitations  at  the  location  of  one  of 
the  thermopile  detectors  in  the  sensor.  A  ground  plane  at  the  location  of  tlie  satellite  was  also  added.  Figure 
23  shows  the  geometry  of  the  simplified  sensor  model.  The  layers  to  the  right  of  the  figure  are,  from  left 
to  right,  Germanium  filter,  air  and  Mylar.  Each  layer  is  3  mm  thick.  The  model  is  a  body  of  rotation  as 
opposed  to  the  detailed  model  used  for  TEMAC3D  [1],  and  the  ground  plane  is  added  at  the  right  of  the 
figure.  Also  notice  that  the  outer  toroidal  cavities  modeled  in  [1]  are  not  present  in  this  /iWaveLab  model. 

Electric  field  contours,  port  scattering  parameters  and  radiation  plots  were  generated  for  H  parallel  and 
E  parallel  symmetry  models.  For  H  parallel  symmetry,  the  electric  field  polarization  is  x  directed.  The 

^This  is  ^WaveLab  terminology  and  is  used  for  consistancy  witli  that  software  package.  The  standard  terminology  for  an 
“II  parallel”  plane  is  a  “perfect  electric  conductor”  plane,  and  an  “E  parallel”  plane  is  a  “perfect  magnetic  conductor”  plane. 
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Figure  23:  Model  of  the  sensor  used  in  /iWaveLab. 

port  scattering  ijarainetcr,  511,  h'!  shown  in  Figure  24.  The  resonance  freqinnicj’  of  3.42  GHz  (TEMAC3D 
showed  a  resonance  at  3.5S  GHz  [1])  was  then  investigated  further  bj'  producing  electric  lield  contours  at 
the  predctorinincd  surfaces  shown  in  Figure  25.  Comparisons  with  the  field  line  charts  shown  in  [11]  can  be 
made.  The  field  intensity  contours  as  presented  by  /iVVaveLab  arc  orthogonal  to  the  field  line  representations 
shown  in  [11].  In  the  field  line  representation,  the  closely  si)accd  lines  rci)rescnt  areas  of  high  field  intensity 
corresponding  to  the  lightly  shaded  areas  in  the  /AVaveLab  output.  The  view  in  the  center  of  tlii!  largo  cavity 
in  Figure  25,  clearly  shows  that  this  re.sonance  has  a  TE\  \  type  mode  structure  in  the  xy  planes.  Tho.se  field 
plots  can  also  be  compared  to  the  results  from  TEMAC3D.  The  excitation  source  for  those  results  is  in  the 
xz  plane  so  that  the  y  polarized  results  from  TEMAC3D  must  bo  comitared  to  these.  The  left  view  of  Figure 
25  is  comparable  to  Figures  12  and  14  (y  polarizations),  and  the  right  view  of  Figure  25  is  comparable  to 
Figure  10  (y  polarization). 

Figure  2G  shows  the  radiation  plot  for  3.43  GHz  on  the  left.  This  plot  for  x  polarized  excitation  corre¬ 
sponds  to  the  plot  for  y  polarized  excitation  from  TEMAC3D  on  the  right.  As  such,  the  (/>  =  0°  plot  for  the 
/AVaveLab  results  corresponds  to  the  </>  =  90°  plot  for  the  TEMAC3D  results,  and  vice  versa.  The  effect  of 
the  ground  plane  modeled  in  /AVaveLab  is  clearly  shown.  Slight  reductions  in  gain  at  the  angles  approaching 
the  ground  plane  are  comirarable  to  those  shown  for  the  TEMAC3D  residts.  Also  note  that  the  /iWavcLab 
model  is  a  simplified  model  as  described  above. 
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Figtiro  24:  Scattering  paranuUcrs  tor  the  H- par  allot  sensor  nioclol. 


Figure  25:  Electric  field  intensity  -  cutaway  view  and  view  at  the  center  of  the  large  cavity. 
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Figure  2G:  /iWaveLab  (3.43  GHz)  and  TEMAC3D  (3.58  GHz)  radiation  results. 

The  E  parallel  model  is  ossoiitially  the  same  as  the  H  parallel  model  with  tlu3  exception  that  the  electric: 
field  excitation  is  in  the  y  direction.  This  polarization  is  analogous  to  the  x  polarization  of  the  TEMAC3D 
study  [1].  Tlie  port  scattering  parameter,  Sn,  as  a  function  of  frec]uenc:y  is  shown  in  Figure  27.  This  time, 
a  resonance  at  3.71  GHz  is  observed.  The  electric  field  contour  plots  for  this  typo  of  symmetry  are  shown 
ill  Figures  28  and  29.  Again,  these  need  to  be  compared  to  the  field  line  plots  of  [11]  in  the  same  manner 
as  before.  Comparisons  can  also  be  made  with  the  results  from  TEMAC3D.  Figures  28  and  29  compare  to 
Figures  12  (y  polarization)  and  14,  and  10  (x  polarizations)  respectively. 

The  radiation  plot  for  this  symmetry  is  shown  in  Figure  30  on  the  left  compared  tc^  the  x  polarization  plot 
of  the  TEMAC3D  results  on  the  right.  The  0  =  0°  plot  of  the  /iWaveLab  results  corresponds  to  the  0  =  90° 
plot  of  the  TEMAC3D  results,  and  vice  versa.  Note  that  the  /AVav^eLab  radiation  plots  arc  normalized  so 
that  the  maxima  arc  all  0  dB.  Tlicrcforc,  comparisons  can  only  be  made  in  the  shape  of  the  curves  as  opposed 
to  absolute  magnitudes. 

Another  model  with  smaller  cells  and  closer  radiation  boundaries  (the  rule  of  thumb  is  that  the  radiation 
boundaries  must  be  A/4  from  the  radiators)  Wcis  created  for  analysis  in  the  6  GHz  range.  All  other  aspects 
of  the  G  GHz  model  are  the  same  as  for  the  3  GHz  model.  Port  scattering  parameters  arc  shown  in  Figure 
31.  Once  again  the  resonance  at  G.58  GHz  compares  well  with  the  TEMAC3D  resonance  at  G.47  GHz.  The 
field  contours  for  this  symmetry  are  presented  in  Figure  32  and  can  bo  compared  to  Figures  13  and  15,  and 
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Frequency  (GHz) 


Figure  31:  Scattering  parameter  for  the  G  GHz  H-parallel  sensor  model, 

11  (all  y  polarizations)  from  the  TEMAC3D  results.  Keep  in  mind  that  the  /iWavcLab  model  is  a  simplified 
body  of  revolution  model  and,  as  such,  is  not  the  same  as  the  TEMAC3D  model. 

Figure  33  compares  the  radiation  plot  for  the  sensor  model  with  H  parallel  symmetry  at  G.58  GHz  on  the 
left  with  the  radiation  plot  from  TEMAC3D  on  the  right.  The  same  rules  as  presented  above  for  comparing 
the  radiation  plots  in  the  3  GHz  range  apply  here.  Notice  that  there  is  not  as  good  agreement  as  was 
observed  for  the  low  frequency  radiation  plots.  This  can  be  explained  by  the  fact  that  the  /zWaveLab  model 
is  luiidi  simpler  that  the  TEMAC3D  model.  The  differeiiccs  in  the  model  results  would  naturally  be  more 
pronounced  at  the  higher  frequencies. 

Port  scattering  parameters  for  the  G  GHz  E  parallel  model  are  shown  in  Figure  34.  The  resonance  at 
G.59  GHz  is  very  close  to  that  of  G.47  GHz  predicted  by  the  TEMAC3D  model.  A  field  intensity  plot  for 
the  G.59  GHz  resonance  is  shown  in  Figure  35.  This  can  be  compared  to  Figure  13  (x  polarization)  for  the 
TEMAC3D  results.  The  radiation  plot  is  shown  in  Figure  36  on  the  left  compared  with  the  TEMAC3D 
results  shown  on  the  right.  Once  again,  the  same  rules  as  presented  above  for  comparing  the  radiation  plots 
apply  here.  As  for  the  H  parallel  symmetry,  the  radiation  plot  results  do  not  compare  well  because  of  the 
differences  in  the  models. 
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Figure  34:  Scattering  parameters  I’or  tin*  (i  GHz  Ei- parallel  model. 


Figure  35:  Electric  field  intensity  at  G,59  GHz. 
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Figure  36:  /AVaveLab  (6.59  GHz)  and  TEMAC3D  (6.47  GHz)  radiation  results. 

The  detailed  models  of  the  sensor  similar  to  those  reported  in  [1]  were  created  so  that  the  results 
of  the  amilysis  could  be  more  reiisonably  compared  to  results  obtained  using  oth(?r  methods  of  analysis. 
However,  this  process  was  not  entirely  successful.  At  times  the  requirements  for  computer  disk  space  wore 
too  groat.  Most  of  the  problems  that  were  not  related  to  availability  of  disk  space  were  with  the  3-5  GHz 
E-parallel  symmetry  with  a  mesh  containing  10,024  elements.  The  initial  solutit)n  of  the  frequency  sweep 
led  to  positive  values  (in  dB)  for  S[\.  This  result,  of  course,  was  suspect  not  only  due  to  the  gain,  but  also 
because  no  re.sonances  similar  to  the  TEMAC3D  results  were  found.  Thinking  that  this  result  may  be  due 
to  not  modeling  the  loss  in  the  germanium,  its  conductivity  was  incorporated  into  the  model.  The  results  of 
this  model  are  pre.sented  below. 

To  better  examine  the  resonance  in  the  6.5  GHz  region,  yet  another  model  was  generated  for  a  frequency 
sweep  between  4  and  C  GHz.  This  model  surprisingly  had  less  elements  (7,903)  than  tlic  previous  one.  This 
may  be  due  to  the  difTcrent  order  in  which  the  regions  were  meshed.  If  the  radiating  region  was  not  meshed 
first,  the  computer  would  freeze  up  while  attempting  to  mesh  this  region.  Tlie  reason  for  this  is  not  known. 
Therefore  to  generate  a  successful  mesh  required  the  meshing  of  the  radiation  region  first.  This  model 
a])orted  in  its  solutions  due  to  an  error  that  Ansoft  claimed  to  be  a  bug.  Two  viable  work-arounds  were 
recommended  by  Ansoft’s  technical  support.  The  first  was  to  try  ditferent  clement  sizes.  Several  attempts 
at  this  produced  the  same  error.  The  second  method  was  to  attempt  meshing  the  cylindrical  surface  first 
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Figure  37:  View  of  the  port  used  for  the  full  sensor  model. 


with  a  2-D  mesh  as  a  seed  for  the  3-D  mesh,  and  to  delete  the  2-D  mesh  after  the  3-D  mesh  was  complete. 
This  also  failed  to  produce  results  and  the  same  error  occurred. 

At  this  point  it  became  clear  that  one  of  the  problems  with  the  first  model  may  have  been  that  the 
elements  in  the  Germanium  may  be  too  largo  due  to  its  high  dielectric  constant  of  14.  A  third  mesh  was 
generated  in  order  to  give  smaller  mesh  elements  in  the  Germanium  regions.  This  mesh  contained  32,400 
elements  and  would  have  certainly  failed  due  to  lack  of  computing  resources.  The  same  error,  however,  that 
occurred  in  the  second  mesh  occurred  while  attempting  to  analyze  this  mesh.  And  again  all  work-arounds 
recommended  by  Ansoft  were  unsuccessful. 

Tlie  resulting  successful  geometric  models  tliat  were  created  took  advantage  of  half  symmetry  with 
either  an  E  parallel  or  H  parallel  plane.  In  these  models  the  germanium  filter  was  modeled  as  in  [1]  which 
led  to  some  difficulty  in  exciting  the  model.  A  port  is  the  only  means  of  exciting  the  cavity,  so  a  waveguide 
must  be  used  to  do  so.  The  waveguide  was  modeled  as  a  circular  wedge,  that  is,  the  cros.s-scction  of  the 
waveguide  was  a  portion  of  the  face  of  the  Mylar  substrate.  Figure  37  shows  the  port  tis  the  left  triangle  with 
the  plane  of  symmetry  to  the  top.  This  allowed  for  the  solution  of  the  lower  frequency  model.  In  summary, 
this  sensor  model  differs  from  that  of  the  previous  section  in  that  the  outer  toroidal  regions  and  the  filter 
regions  are  modeled  as  in  [1],  not  as  a  body  of  revolution  as  in  the  simplified  model.  A  ground  plane  was 
also  added  at  the  plane  of  the  lens,  not  at  the  location  of  the  satellite  as  in  the  simplified  model. 

Port  scattering  parameters  were  generated  for  H  parallel  symmetry  and  arc  shown  in  Figure  38.  The 
resonance  at  6.4  GHz  was  investigated  further.  Field  plots  are  shown  in  Figure  39.  Comparison  of  these 
plots  to  Figures  11  and  13  (y  polarizations)  show  remarkable  similarity.  Again,  the  T£’ou  mode  is  clearly 
identifiable. 

For  the  E  parallel  symmetry  model.  Figures  40  and  41  show  S  parameter  results  for  the  3-5  GHz  models 
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Figurt?  38:  Scatti'riiig  Paianioters  for  tlio  G  -  7  CJII/.  rnodt?!. 


Figure  39:  Electric  field  inagiiituih'  fur  tin'  xy  plane  1/3  below  tlie  Kmis,  and  tin*  yz  plane  -  G.47  GHz. 
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Figure  40:  Sn  foi’  the  sensor  model  with  no  loss. 

without  and  with  loss  in  the  Germanium  regions.  Clearly,  modeling  of  loss  is  important  for  this  sensor  in  the 
FEM  program.  The  scattering  parameter  plot  for  the  G-7  GHz  model  is  shown  in  Figure  42.  The  resonance 
at  G.5  GHz  was  investigated  further  yielding  the  radiation  plot  shown  in  Figure  43.  Compare  this  plot  with 
Figure  3G.  Correlation  is  better  than  for  the  simplified  model  presented  above. 

Field  contour  plots  arc  shown  in  Figure  44.  Notice  that  the  fields  arc  capable  of  entering  the  lower  toroidal 
cavity  for  this  model.  The  field  distribution  in  the  central  cavity  can  be  compared  to  the  TEMAC3D  results 
of  Figures  11  and  15  for  the  y  polarization.  The  mode  suggested  is  once  again  TEqh- 
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Figure  41:  Sn  for  the  sensor  model  with  loss  3-5  GHz. 
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Figure  42:  Sn  for  the  sensor  model  with  loss  G-7  GHz. 
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Figure  44:  Field  contours  at  G.5  GHz  with  (a)  lens  removed,  (1))  lens  and  lower  toroidal  cavity  removed,  (c) 
lens,  lower  and  upper  toroidal  cavities  and  1/3  of  central  cavity  removed. 
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Summary,  Conclusions  and  Recommendations 

The  TSAR  FDTD  solver  relies  on  the  BRL-CAD,  Anastasia  and  Image  for  pre-processing,  and,  in  this 
study,  Mathcad  and  Axum  for  post-processing.  Unfortunately,  problems  with  installation  of  the  entire  TSAR 
software  suite  were  insurmountable,  driving  the  use  of  a  modified  version  of  the  TSAR  FDTD  solver  from 
Sandia  Laboratory.  This  version  also  had  problems  with  a  radiation  model  of  both  the  Celestron-8  and  the 

sensor.  TSAR  runs  for  the  scattering  problems  with  symmetry  and  a  ground  plane  were,  however,  successful 
as  validated  by  TEMAC3D. 

Though  the  results  of  ^WaveLab  for  the  most  part  have  been  accurate,  the  program  has  some  drawbacks. 
First  the  bugs  which  prevented  analysis  on  many  of  the  models  are  a  constant  problem.  The  type  of  error 
which  occurred  most  was  a  problem  with  the  mesh  near  cylindrical  walls.  As  cylindrical  walls  are  pervasive 
in  most  of  these  problems,  the  error  surfaces  often.  Next  is  the  problem  of  excitation.  The  only  way  to  excite 
the  cavity  is  by  means  of  a  waveguide  of  some  kind.  This  means  first  that  the  point  of  excitation  must  occur 
along  a  metal  boundary  of  the  model  and  second  that  the  dimensions  of  the  guide  must  permit  passage  of 
the  frequencies  being  considered.  This  especially  posed  a  problem  at  the  lower  frequencies  rvs  the  maximum 
dimension  available  to  place  a  waveguide  in  w,ns  barely  large  enongh  to  lit  a  port  valid  at  that  frequency  on. 
Thi.s  may  be  one  reason  the  results  at  the  3-5  GHz  models  perfor.ned  so  poorly.  Finally,  large  comprhational 
resources  are  still  necessary  with  FEM  because  of  the  need  to  solv.^  a  large  matrix.  For  n  unknowns  (field 

quantities  in  the  problem  space),  this  results  in  a  memory  reciuiremenl  of  n''^,  where  for  the  FDTD  method, 
the  memory  requirement  is  n. 

Significant  findings  from  /iVVaveLab  were  that  it  produced  results  consistent  with  TEMAC3D.  The  use 
of  symmetry  and  second  order  finite  elements  allowed  analysis  at  frecinoncies  ninch  higher  that  wius  achieved 
with  IIP  HFSS.  Pro-  and  most  post-processing  can  be  accomplished  with  /iWaveLab.  The  HFSS  capability 
of  viewing  fields  on  any  plane  in  the  problem  space  were  not  available  with  /iWaveLab.  This  capability  is 
an  asset  because  location  and  types  of  fields  of  interest  are  in  general  not  know  a  priori.  If  this  capability 
docs  not  exist,  the  modeler  must  rely  on  ox])oricncc  to  identify  the  likely  locations  and  types  of  fields  to  bo 
viewed.  If  that  does  not  work,  trial  and  error  is  the  only  alternative. 

Based  on  prior  studies  cited  and  the  results  of  this  report,  the  conclusions  arc: 


1.  In  general,  the  FDTD  codes  are  the  most  computationally  efficient  for  large  open  problems.  Both 
TEMAC3D  and  TSAR  run  on  UNIX  platforms  but  could  conceivably  be  ported  to  a  PC  running 
LINUX.  The  disadvantage  for  both  codes  is  that  they  are  tied  to  BRL-CAD  for  geometric  modeling 
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and  to  Anastasia  and  Image  for  mesh  generation  and  viewing.  While  BRL>CAD  has  been  ported  to 
AIX  and  resides  on  an  IBM  RS6000  at  the  University  of  Denver,  Anastasia  an  Image  have  not  been 
successfully  ported  to  AIX  (the  attempt  has  been  made)  or  to  LINUX  (yet  to  be  tried).  Anastasia 
and  Image  must  still  be  used  on  a  SUN  platform  at  Phillips  Laboratory.  Both  FDTD  codes  require 
external  programs  for  FFTs  and  graphical  presentation  of  data, 

2.  TSAR  is  required  for  large  problems  with  symmetry  or  if  a  ground  plane  is  necessary.  Application  of 
symmetry  conditions  reduces  required  computational  resources  and  allows  the  high  frequency  limit  of 
the  simulation  to  be  significantly  extended.  However,  even  though  TSAR  is  capable  of  doing  radiation 
problems,  this  feature  does  not  work  for  the  version  of  TSAR  that  has  been  installed  at  the  University 
of  Denver.  A  port  to  LINUX  of  the  full  TSAR  Suite  should  be  attempted. 

3.  TEMAC3D  is  better  than  TSAR  for  radiation  problems  which  do  not  have  to  implement  symmetry 
or  a  ground  plane.  Even  though  each  simulation  is  done  for  only  one  frequency,  the  use  of  a  separate 
program  to  generate  any  radiation  pattern  desired  is  a  distinct  advantage  over  TSAR  which  can  have 
more  than  one  frequency  identified  for  the  problem  but  with  a  limited  number  of  radiation  patterns. 
Desired  radiation  patterns  are,  in  general,  not  known  a  priori.  Both  TSAR  and  TEMAC3D  require 
definition  of  planes  for  field  data  prior  to  running  the  problem. 

4.  Ansoft  /iWavcLab,  limited  to  UNIX  platforms  and  destined  to  be  replaced  by  the  Ansoft  High  Fre¬ 
quency  Structure  Simulator  (HFSS)  Version  5,  is  well  suited  to  closed  network  parameter  problems 
and  single  frequency  radiation  prolfiems.  As  with  TSAR  and  TEMAC3D,  its  field  visualization  capa¬ 
bilities  arc  limited  due  to  a  priory  definition  of  surfaces.  Future  u.se  of  Ansoft  HFSS  will  overcome  this 
deficiency.  (HFSS  will  also  be  available  for  the  PC.)  Radiation  and  field  plots  can  be  generated  in  any 
plane  desired.  The  high  computational  expense  of  doing  frequency  sweeps  with  large  open  problems 
put  it  at  a  distinct  disadvantage.  There  is  no  capability  of  doing  plane  wave  scattering  problems. 

The  recommendations  are:  (1)  In  the  absence  of  a  comprehensive  test  and  measurement  program,  various 
different  numerical  methods  should  be  used  for  verification  of  results.  (2)  An  FEM  code  such  as  Ansoft  HFSS 
should  be  used  for  the  closed  optical  train  section  of  larger  optical  systems,  and  codes  such  as  TEMAC3D 
and  TSAR  should  be  used  for  analysis  of  the  open  optical  regions.  (3)  An  attempt  to  port  BRL-CAD  and 
the  TSAR  Software  Suite  to  LINUX  should  be  accomplished.  If  successful,  the  extensive  post- pro  cessing 
capabilities  contained  in  the  TSAR  Software  Suite  could  then  be  evaluated. 
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ABSTRACT 

Gas  turbine  engine  cycle  calculations  are  used  to  estimate  contrail  factors  in  a  range  of  flight  environments  at  a  range 
of  power  settings,  for  generic  low-bypass  and  high-bypass  turbofan  engines.  It  is  argued  that  the  proper  frame  of 
reference  for  performing  this  calculation  is  that  of  an  air  parcel  moving  with  the  ambient  winds.  The  most  critical 
part  of  the  contrail  factor  calculation  is  to  estimate  the  change  in  total  enthalpy  such  a  parcel  experiences  as  it  is 
ingested  by  a  gas  turbine  engine,  compressed,  heated,  expanded,  and  exhausted.  Using  this  frame  of  reference,  it  is 
seen  that,  contrary  to  assumptions  made  in  current  methods  of  contrail  forecasting,  the  contrail  factor  is  not  constant 
for  a  given  engine  type.  It  varies  even  for  the  same  engine  at  different  power  settings  and  flight  conditions.  The 
range  of  conditions  investigated  included  flight  at  altitudes  between  25000  and  50000  ft  in  a  standard  atmosphere, 
Mach  numbers  ranging  from  0.4  to  0.9,  and  power  settings  from  idle  to  military.  The  contrail  factor  ranges  from 
0.030  to  0.053  (g/kg)/°C  for  the  low-bypass  engine,  and  from  0.038  to  0.090  (g/kg  )/®C  for  the  high-bypass  engine, 
depending  on  flight  environment,  Mach  number,  and  power  settings.  Contrail  factors  are  generally  higher  at  lower 
power  settings  at  a  given  Mach  number,  and  higher  at  higher  Mach  numbers  at  a  given  power  setting.  Contrail 
factors  tend  to  be  higher  at  higher  altitudes  in  general.  In  order  to  evaluate  the  contrail  factor  calculations  based  on 
cycle  calculations,  a  set  of  contrail  factors  is  diagnosed  from  a  detailed  set  of  opportunistic  observations  of  contrails 
observed  in  near-threshold  conditions  for  formation.  The  aircraft  involved  were  mainly  civil  transport  aircraft,  and 
were  powered  with  a  variety  of  engine  types.  The  range  of  contrail  factors  inferred  from  these  observations  is 
compared  to  the  range  of  those  computed  from  cycle  calculations  for  different  conditions.  Reasonable  agreement  is 
found. 
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INTRODUCTION 


Recent  reviews  (e.g.  Schumann,  1996;  Schrader,  1997)  have  noted  that  aircraft  condensation  trails,  or  contrails,  have 
been  observed  since  the  early  days  of  aviation.  Combat  requirements  during  the  World  War  11  era  led  to  quantitative 
efforts  to  forecast  the  conditions  in  which  contrails  will  form  (Schumann,  1996).  Detailed  discussions  of  contrail 
formation  forecasting  by  U.S.  Air  Force  personnel  and  contractors  appeared  in  the  open  refereed  literature  beginning 
in  the  1950’s  (e.g.  Appleman,  1953;  Downie  and  Silverman,  1957;  Pilie  and  Jiusto,  1958). 

Contrails  are  an  example  of  mixing  cloud  formation,  a  mode  of  cloud  formation  that  has  been  studied  and  understood 
in  a  modem  sense  since  early  in  this  century.  Despite  the  straightforward  theory  behind  contrail  formation,  and 
despite  the  strong  foundation  laid  by  the  early  publications  on  this  subject,  to  this  day  there  still  appear  misguided 
treatments  of  contrail  formation,  as  in  Hanson  and  Hanson  (1995).  This  report  has  three  objectives,  (1)  discuss  some 
of  the  misconceptions  appearing  in  recent  treatments  of  the  theory  of  contrail  formation,  (2)  develop  a  conceptual 
framework  for  analyzing  contrail  formation  thermodynamically  in  a  way  that  accounts  for  aspects  of  the  problem  not 
currently  considered  explicitly  in  Air  Force  operational  forecasting,  and  finally  (3)  to  compare  predictions  made 
using  this  framework  to  observations  of  contrail  formation  obtained  during  a  recent  Air  Force  Research  Laboratory 
field  program. 

CONTRAIL  FORMATION 

Condensation  trails  form  behind  aircraft  when  they  fly  in  sufficiently  cold  and  humid  air.  These  trails  are  an  example 
of  a  cloud  formed  by  isobaric  mixing  of  warmer  air  that,  while  not  saturated,  contains  a  relatively  high  mixing  ratio 
of  water  vapor,  with  colder  air  that  also  is  not  saturated  and  contains  a  relatively  low  mixing  ratio  of  water  vapor. 
Under  suitable  conditions  it  is  possible  for  saturation  to  be  reached  in  some  mixtures  of  these  two  types  of  air.  The 
process  of  cloud  formation  by  mixing  has  been  understood  at  least  since  the  beginning  of  modem  meteorology  in  this 
century  and  is  described  in  detail  in  many  of  the  classic  modern  texts  (e.g.  Geddes,  1921;  Petterssen,  1956;  Haltiner 
and  Martin,  1957;  Iribarne  and  Godson,  1981;  Bohren  and  Albrecht,  1998). 

It  is  important  to  note  that  when  two  parcels  with  different  properties  mix  isobarically,  it  is  vapor  mixing  ratio  and 
enthalpy  that  are  mixed  in  proportion  to  the  mass  of  air  of  each  of  the  two  original  types  in  the  mixture.  The  state  of 
the  mixture  is  determined  by  this  mixing  of  vapor  and  enthalpy.  In  most  meteorological  treatments  of  contrail 
formation  (e.g.  Appleman,  1953;  Schrader,  1997),  the  mixing  is  described  in  terms  of  vapor  mixing  ratio  and 
temperature.  However,  temperature  is  not  a  function  of  state,  but  enthalpy  is  a  function  of  state.  (See,  e.g.,  Bohren 
and  Albrecht,  1998.)  Plume  mixing  described  in  terms  of  mixing  of  temperature  is  a  reasonable  approximation  in  a 
meteorological  context  because  enthalpy  is  given  by  CpT,  where  Cp,  is  the  specific  heat  of  air  at  constant  pressure. 
This  specific  heat  is  effectively  constant  and  independent  of  temperature  within  the  range  of  temperatures 
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encountered  in  the  troposphere.  If  Cp  is  constant,  then  enthalpy  is  directly  proportional  to  temperature  and  mixing 
can  be  described  in  terms  of  mixing  of  vapor  mixing  ratio  and  temperature. 

In  the  context  of  contrail  formation,  however,  mixing  clouds  form  by  mixing  between  parcels  with  temperatures 
differing  by  more  than  1000  R  in  some  instances.  The  specific  heat  of  air,  Cp,  varies  significantly  over  this  range  of 
temperatures,  as  shown  in  Figure  1.  This  variation  has  been  incorrectly  ignored  in  many  treatments  of  contrail 
formation,  including  a  recent  treatment  by  this  author!  (Detwiler,  1996).  It  can  be  seen  in  Figure  1  that  Cp  varies  by 
about  6%  over  the  range  of  temperatures  considered,  which  is  typical  of  contrail  formation  circumstances.  This 
.meiins...thatlhe..traditiQnal._contraiLfactor^  .the.ratio.  of  .the  vapor,  mixing  . ratio,  difference  to. the  temperature  difference 
between  exhaust  and  environment,  expressed  as  (g/kg)/°C,  is  not  constant  over  the  entire  range  of  possible  exhaust 
temperatures. 


T(R) 


Figure  1.  The  variation  of  the  specific  heat  of  air  at  constant  pressure,  Cp,  with  temperature. 


The  process  of  contrail  formation  is  illustrated  below  with  a  mixing  diagram,  derived  from  Figure  1  in  Detwiler 
(1996),  shown  here  as  Figure  2.  In  Figure  2  mixing  is  shown  in  the  framework  of  vapor  mixing  ratio  and  enthalpy. 
Isobaric  mixing  at  300  mb  ambient  total  pressure  is  represented.  Air  characterized  by  mixing  ratio  and  enthalpy 
represented  on  the  diagram  by  Point  A,  mixing  with  air  represented  by  Point  B,  will  produce  mixtures  whose  mixing 
ratio  and  enthalpy  lie  along  a  straight  line  connecting  the  two  points.  This  line  will  be  called  the  mixing  line.  (If 
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temperature  were  used  on  the  horizontal  axis,  then  the  line  would  have  slight  curvature  due  to  the  variation  of  Cp 
with  temperature.  The  mixing  line  on  a  mixing  ratio  versus  temperature  diagram  will  be  essentially  straight  at  the 
lower  end,  since  Cp  is  essentially  constant  over  the  meteorological  temperature  range.)  Its  slope  at  any  point  along  it 
is  the  traditional  contrail  factor  if  the  horizontal  axis  is  expressed  in  terms  of  temperature.  Mixtures  that  are  mostly 
A  will  lie  near  A  on  the  diagram.  Mixtures  that  are  mostly  B  will  lie  near  B.  If  the  axes’  scales  on  the  diagram  are 
linear,  then  the  distance  from  the  end  points  is  inversely  proportional  to  the  proportion  of  that  type  of  air  in  the 
mixture. 
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Figure  2.  A  diagram  illustrating  a  hypothetical  mixing  line  between  air  with  vapor  mixing  ratio  and  enthalpy 
characteristic  of  high  temperature  gas  turbine  exhaust,  and  air  with  a  mixing  ratio  and  enthalpy  characteristic  of  the 
upper  troposphere.  The  exhaust  has  a  temperature  of  973  K  (1751  R)  while  the  tropospheric  air  has  a  temperature  of 
221  K  (398  R)  and  a  relative  humidity  of  40%.  The  contrail  factor  in  this  example  has  a  value  of  0.697 
(g/kg)/(BTU/lbm),  which  is  equivalent  to  0.030  (g/kg)/°C  near  ambient  conditions.  At  this  scale,  the  saturation 
mixing  ratio  curves  with  respect  to  ice  and  liquid  water  are  indistinguishable.  The  saturation  mixing  ratios  as  a 
function  of  temperature  are  computed  using  the  Goff-Gratch  formulae  and  are  plotted  against  the  enthalpy  of  air  at 
that  temperature. 


Conditions  near  the  lower  end  of  the  mixing  line  are  shown  in  expanded  scale  in  Figure  3.  For  this  set  of  conditions, 
the  mixing  line  crosses  above  the  saturation  mixing  ratio  line  with  respect  to  liquid  water,  and  condensation  is 
expected. 
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Contrail  Formation  Chart 
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Figure  3.  An  expanded  view  of  the  lower  end  of  the  mixing  line  shown  in  Figure  2,  indicating  that  liquid 
condensation  will  occur  during  mixing  for  the  conditions  specified  above. 


As  discussed  by  many  authors,  most  recently  by  Schrader  (1997)  and  Jensen  at  al  (1998),  contrail  formation  will 
occur  when  the  mixing  line  crosses  the  saturation  mixing  ratio  line  for  liquid  water.  This  indicates  that  some 
mixtures  are  supersaturated  with  respect  to  liquid  water  and  condensation  on  suitable  exhaust  nuclei  will  occur.  The 
slope  of  the  mixing  line  for  conditions  near  the  ambient  atmospheric  temperatures,  where  condensation  actually 
occurs,  is  equal  to  the  traditional  contrail  factor.  This  slope  can  be  expressed  in  terms  of  (g/kg)/°C,  using  Figure  2,  if 
one  divides  enthalpy  in  the  range  of  meteorological  temperatures,  by  Cp  in  this  temperature  range,  using  appropriate 
units. 

Detwiler  (1996),  among  others,  shows  that  the  critical  temperature  for  contrail  formation,  the  highest  temperature  at 
which  contrails  will  form  at  a  given  pressure  level  and  relative  humidity,  is  the  temperature  at  which  the  slope  of  the 
curve  of  saturation  mixing  ratio  versus  temperature  is  just  equal  to  the  contrail  factor.  Since  the  standard 
meteorological  value  of  Cp  is  valid  at  these  temperatures,  it  is  proper  to  divide  the  slope  of  the  mixing  line  in  mixing 
ratio-enthalpy  space  by  Cp  to  find  the  slope  in  mixing  ratio-temperature  space. 
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ESTIMATING  CONTRAIL  FACTOR  from  ENGINE  CYCLE  DATA 


Previous  Work 

Appleman  (1953)  estimated  the  slope  of  the  mixing  line  by  taking  the  ratio  of  water  to  heat  produced  during 
combustion  without  considering  any  of  the  details  of  engine  operation.  Appleman  estimated  that  for  every  Ibm  of 
fuel  burnt,  1.4  Ibm  of  water,  and  17974  BTU,  were  produced.  The  enthalpy  of  combustion  is  divided  by  Cp  at  typical 
ambient  temperatures  to  estimate  the  temperature  change  due  to  combustion.  The  implicit  assumptions  in  this 
approach  are  that  all  of  the  water  and  heat  generated  during  combustion  is  contained  in  the  gases  leaving  the  exhaust 
nozzle,  and  that  Cp  is  constant.  While  Appleman  made  his  computation  using  data  for  a  fuel  used  during  World  War 
II,  Jiusto  and  Pilie  (1958)  in  the  same  manner  computed  a  contrail  factor  for  JP-4,  the  most  common  military  fuel  in 
use  then  and  now(very  close  in  properties  to  Jet-A,  the  most  common  civilian  fuel  then  and  now),  arriving  at  a  value 
of  0.0295  (g/kg)/K.  A  detailed  discussion  of  this  procedure,  along  with  some  of  the  uncertainties  associated  with  it, 
is  presented  in  Detwiler  (1996). 

A  different  approach  to  estimating  the  contrail  factor  is  to  use  measurements  or  calculations,  based  on  the  aircraft 
engine  cycle,  of  the  water  generated  by  a  particular  rate  of  fuel  combustion,  and  divide  by  the  difference  in  enthalpy 
between  the  exhaust  gases  leaving  the  nozzle  and  the  surrounding  environment.  Peters  (1993)  obtained  engine 
manufacturer’s  data  on  several  engines  from  which  to  calculate  contrail  factors  in  this  manner.  These  engine  types 
included  the  turbojet,  and  low-  and  high-bypass  gas  turbine  engines.  The  data  included  tailpipe  moisture  and 
temperature,  for  a  wide  range  of  power  settings,  flight  Mach  numbers,  and  flight  levels.  He  does  not  say  so  explicitly, 
but  for  the  bypass  engines  it  appears  that  he  used  numbers  characteristic  of  the  engine  core  exhaust.  Other  critical 
details  not  described  in  his  report  include  the  ratio  of  water  created  to  fuel  consumed,  and  whether  the  exhaust 
temperatures  used  were  static  or  a  total  temperatures.  He  gives  the  following  characteristic  contrail  factors  for  each 
type  of  engine: 

non-bypass  (turbojet)-  0.0360  (g/kg)/°C 
low-bypass  -  0.0400  (g/kg)/°C 

high-bypass  -  0.0490  (g/kg)/°C 


Contrail  factors  can  also  be  computed  empirically  from  measured  fuel  flow  mf,  engine  air  flow  me ,  ambient  static 
temperature  Ta,  and  exhaust  exit  total  (stagnation)  temperature  Teq.  One  form  of  the  estimate  is  presented  in  Saatzer 
(1995a) 
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CF  =  [1.3  mf]/[(m,)(TEo-Ta)] 


(1) 


where  “1.3”  represents  Saatzer’s  assumption  of  the  ratio  of  water  mass  generated  per  unit  mass  of  fuel  combusted 
(The  fuel  is  assumed  to  be  represented  chemically  as  (CH2)n.  One  water  molecule  is  produced  for  each  CH2  group 
oxidized.  The  ratio  of  molecular  weights  of  water  to  CH2  is  18/14  which  is  -1.3.)  Other  estimates  of  the  ratio  of 
water  produced  per  unit  of  JP-4  or  Jet-A  fuel  combusted,  based  on  actual  measurements  of  fuel  composition,  are  as 
low  as  1.22,  as  in  Busen  and  Schumann,  1995.  The  fuel  flow  mf  multiplied  by  1.3  is  used  to  estimate  the  water 
added  to  the  exhaust  by  combustion.  Fuel  flow  can  be  neglected  with  respect  to  the  air  flow  ma,  in  the  denominator 
of  (1)  for  all  practical  calculations,  as  the  air  flow  is  usually  not  known  to  better  than  -2%,  and  typical  maximum 
fuel/air  ratios  are  3-4%.  The  engine  exit  total  temperature  Teo  is  a  measure  of  the  total  energy,  thermal  plus  directed 
kinetic  energy  of  motion,  created  by  combustion  and  added  to  the  engine  exhaust.  The  difference  (Teo  -  Ta)  is 
proportional  to  the  energy  added  by  combustion.  (In  fact,  Saatzer  assumes  that  Cp  is  independent  of  temperature  and 
therefore  that  (Teo  -  Ta)  is  exactly  proportional  to  the  enthalpy  of  combustion.) 

A  several-month  flight  test  program  involving  turbojet-powered  T-33’s  is  described  in  Saatzer  (1995a).  Contrail 
factors  were  computed  from  engine  measurements  using  equation  (1)  during  climb,  descent,  and  steady  level  flight. 
Contrail  factors  computed  for  the  T-33  ranged  from  0.025  to  0.039  (g/kg)/°C.  Factors  as  low  as  0.025  were  obtained 
during  engine  decelerations/descents,  values  near  0.032  in  steady  flight,  and  factors  as  high  as  0.039  during 
accelerations/climbs.  The  low  contrail  factors  during  decelerations  were  transient  and  due  to  the  fuel  flow 
decreasing  more  quickly  than  the  exhaust  temperature.  The  reverse  was  true  during  accelerations.  The  median  value 
of  0.032  (g/kg)°C‘^  for  steady  flight  is  slightly  higher  than  the  value  estimated  from  fuel  properties,  but  lower  than 
the  value  suggested  by  Peters  (1993)  for  a  turbojet.  Good  agreement  was  obtained  between  contrail  factors  inferred 
from  the  observed  contrail  onset  environmental  conditions,  and  those  computed  from  aircraft  operating  parameters. 

Building  on  the  Previous  Work 

In  this  work,  engine  cycle  data  were  computed  for  two  engine  configurations  and  a  variety  of  flight  conditions, 
following  the  line  of  investigation  opened  by  Peters  (1993).  Contrail  factors  were  first  computed  using  a  variation 
on  (1),  as  described  in  Saatzer  (1995a).  It  is  presumed  that  Peters  performed  essentially  the  same  calculation, 
although  there  is  insufficient  detail  in  Peters  (1993)  to  verify  this.  One  minor  adjustment  was  that  the  constant  factor 
in  the  numerator  of  (1)  was  reduced  to  1.28,  as  justified  in  Detwiler  (1996). 

One  difference  between  the  present  work  and  the  previous  work  just  described  is  that  in  the  previous  work  the 
mixing  process  is  incorrectly  represented  with  a  straight  line  connecting  two  different  initial  mixtures  in  vapor 
mixing  ratio  versus  temperature  space.  As  reported  above,  isobaric  mixing  is  more  correctly  represented  with  such  a 
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straight  line  in  vapor  mixing  ratio  versus  enthalpy  space.  A  computation  of  contrail  factor  in  units  of  (g/kg)/°C, 
equivalent  in  definition  to  the  contrail  factor  of  Appleman  (1953),  can  be  made  by  dividing  the  slope  of  the  mixing 
line  in  mixing  ratio  -  enthalpy  space,  (g/kg)/(BTU/lbm),  by  Cp  (BTU/(lbm  R))  and  then  multiplying  by  9/5. 

When  this  contrail  factor  estimate  is  based  on  the  ratio  of  water  to  heat  produced  during  combustion,  and  the  value  of 
Cp  employed  is  appropriate  for  upper  tropospheric  temperatures,  then  the  contrail  factor  found  is  appropriate  for 
estimating  the  threshold  temperature  for  contrail  formation  by  the  matching-slope  method,  equivalent  to  the 
procedures  of  Appleman  (1953),  Jiusto  and  Pilie  (1958),  Jensen  et  al  (1998),  and  others. 

Comparing  the  results  of  the  procedure  followed  by  Peters  (1993)  and  Saatzer  (1995a)  to  the  present  work,  the 
previously  computed  contrail  factors  are  too  low  due  to  an  underestimate  of  the  enthalpy  difference  between 
environment  and  exhaust.  The  hotter  the  exhaust,  the  more  the  contrail  factor  will  be  underestimated  if  variation  of 
Cp  with  temperature  is  ignored.  For  exhaust  stagnation  temperatures  around  1750  R,  the  underestimate  will  be  -6%, 
resulting  in  an  estimate  of  contrail  onset  temperature  ~0.6  K  too  low  (Detwiler,  1996). 

Further  refinements 

Engines  with  separate  exhaust  streams 

These  first  attempts  at  contrail  factor  estimation  were  subsequently  modified.  For  bypass  engines  with  separate 
unmixed  exhaust  streams,  all  of  the  water  generated  during  combustion  is  in  the  core  stream,  but  the  enthalpy 
generated  is  distributed  to  both  the  core  and  bypass  streams.  In  the  results  presented  below,  the  contrail  factor  for 
turbofan  cycles  with  separate  exhausts  is  computed  based  on  the  core  properties.  Thus  contrail  factors  for  the  core 
flow  in  turbofan  cycles  with  separate  exhausts  represents  a  maximum  possible  contrail  factor.  Some  mixing  between 
core  and  fan  stream  may  occur  before  contrail  formation  occurs  aft  of  the  engine.  If  mixing  occurs,  the  appropriate 
contrail  factor  should  be  somewhere  between  the  core  contrail  factor  and  a  contrail  factor  representing  completely 
mixed  bypass  and  core  streams.  Saatzer  (1995b)  attributes  the  higher  contrail  factors  for  high-bypass  turbofan 
engines  compared  to  low-bypass  engines,  deduced  during  flight  tests,  to  incomplete  mixing  of  the  separate  streams 
prior  to  contrail  formation. 

Propulsion  Efficiency 

An  important  assumption  in  the  computation  of  contrail  factor  from  fuel  properties  is  the  assumption  that  all  of  the 
enthalpy  added  to  the  engine  air  stream  during  combustion  is  present  in  the  exhaust  leaving  the  engine.  Detwiler 
(1996)  reviewed  the  debate  over  this  assumption.  Schumann  (1996),  quoting  earlier  work  by  Schmidt  in  Germany 
during  World  War  H,  points  out  that  some  of  the  energy  generated  during  combustion  must  be  used  to  propel  the 
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aircraft.  This  energy  is  not  present  in  the  near-field  engine  exhaust  plume  but  dispersed  in  the  wake  of  the  aircraft 
outside  of  the  engine  exhaust  plume.  This  separated  energy  eventually  mixes  back  together,  but  not  for  many  aircraft 
widths  aft.  Contrail  formation  typically  occurs  well  before  these  separate  regions  mix  back  together. 

Using  a  computation  of  propulsion  efficiency  to  estimate  the  fraction  of  combustion  energy  in  the  aircraft  wake 
outside  of  the  exhaust  plumes,  and  computing  contrail  factors  using  the  Appleman  approach  but  discounting  this 
energy  from  the  exhaust,  Busen  and  Schumann  (1995),  Schumann  et  al  (1996),  Schumann  (1996),  and  Jensen  et  al 
(1998)  report  reasonable  agreement  between  estimated  and  observed  contrail  onset  temperatures.  These  authors,  in 
contrast  to  Saatzer,  attribute  the  higher  contrail  factors  for  high-bypass  compared  to  low-bypass  engines,  diagnosed 
from  observations,  to  be  due  to  higher  propulsion  efficiency  for  higher  bypass  engines. 

An  appropriate  frame  of  reference  for  contrail  factor  calculations 

After  trying  several  approaches  to  estimating  contrail  factors  based  on  engine  cycle  calculations,  and  observing  some 
inconsistencies,  it  was  decided  to  adopt  a  new  reference  frame  for  the  calculations  in  order  to  simplify  the 
interpretation  of  the  results  in  terms  of  propulsion  efficiency  and  separate  exhaust  streams.  Busen  and  Schumann 
(1995),  and  Saatzer  (1995  a), following  Osterle  (1956),  and  others,  look  at  contrail  formation  in  a  reference  frame 
moving  with  the  engine.  This  is  convenient  in  some  respects,  as  total  temperature  and  pressure  measurements  (in  the 
case  of  real  engines),  and  total  temperature  and  pressure,  and  exit  velocity,  computations  (in  the  case  of  cycle 
calculations)  are  all  made  in  this  reference  frame.  However,  Figure  2  shows  that  contrail  formation  actually  occurs 
in  mixtures  that  are  almost  completely  environmental  air.  If  one  assumes  that  turbulent  mixing  of  heat  and 
momentum  proceed  at  the  same  rate,  then  this  newly-formed  contrail  region  should  be  moving  almost  with  the 
environmental  winds.  Almost  all  of  the  directed  kinetic  energy  in  the  exhaust  plume,  the  thrust  which  the  engine 
generates,  will  have  been  reduced  to  random  thermal  energy  by  the  time  contrail  formation  occurs.  Thus  the  natural 
frame  of  reference  for  computing  contrail  factors  is  the  one  of  the  environment  through  which  the  aircraft  is  moving. 

In  this  reference  frame  moving  with  the  environmental  winds,  one  can  imagine  a  parcel  of  air  lying  in  the  path  of  an 
advancing  jet  engine.  The  parcel  is  overtaken  by  the  engine,  drawn  into  it,  compressed,  heated,  has  enthalpy  injected 
by  combustion,  expands  through  the  turbine  and  nozzle,  is  exhausted  at  a  high  speed,  and  is  left  behind  mixing  with 
the  unaltered  environment  as  the  engine  moves  away. 

The  water  vapor  mixing  ratio  in  this  exhaust  air  is  estimated  as 

1.28  mf/(mf-i- me)  (2) 
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The  total  enthalpy  of  this  exhaust  air  is 


Hes  +  V2  (Ve  -  Vp)^  (3) 

where  Hes  is  the  static  enthalpy  of  the  exhaust  at  the  exit  plane,  CpeTes,  The  Cpe  is  the  specific  heat  at  constant 
pressure  at  Tes,  and  the  second  term  in  (3)  is  the  kinetic  contribution  to  total  enthalpy,  in  the  reference  frame  of  the 
air  through  which  the  engine  is  moving.  It  is  assumed  that  this  exhaust  is  brought  to  rest  relative  to  the  air  around  it 
by  mixing  with  it,  so  that  this  total  enthalpy,  initially  the  sum  of  substantial  static  and  kinetic  parts,  is  converted  to  all 
static  enthalpy  at  the  point  where  contrail  formation  begins. 

Saatzer  (1995a),  equation  6.5,  gives  a  definition  of  exhaust  total  enthalpy  equivalent  to  equation  (3),  above. 

However,  in  his  subsequent  calculations,  e.g.  equation  6.7  et  seq,  he  appears  to  use  total  temperature  in  the  aircraft 
frame  of  reference  as  the  basis  for  estimating  total  enthalpy  in  the  exhaust.  This  provides  an  overestimate  of  the  total 
enthalpy  in  the  exhaust  in  the  frame  of  reference  of  the  air  through  which  the  aircraft  is  moving,  and  yields  an 
underestimate  of  the  contrail  factor,  and  more  importantly,  a  contrail  factor  that  is  independent  of  airspeed. 

In  the  reference  frame  of  the  air,  there  needs  to  be  no  additional  correction  for  propulsion  efficiency,  as  is  employed 
by  Busen  and  Schumann  (1995),  Jensen  et  al  (1998),  and  others.  In  the  frame  of  the  engine,  as  used  by  these 
workers,  exhaust  speed  relative  to  the  engine  plus  static  temperature,  or  equivalently  the  total  temperature 
(abbreviated  as  EGT,  exhaust  gas  temperature)  determines  the  total  enthalpy  of  the  exhaust  at  the  exit  plane.  A 
separate  calculation  of  propulsion  efficiency  (see  Schumann,  1996,  or  Detwiler,  1996,  for  a  review)  is  required  to 
correct  for  energy  not  in  the  exhaust  plume  of  a  moving  engine.  In  the  reference  frame  of  the  air  through  which  the 
engine  moves,  no  correction  for  propulsion  efficiency  is  required.  Note  that  in  (3),  as  the  aircraft  speed  more  nearly 
approaches  the  exhaust  speed  (as  it  does  for  high-bypass  ratio  turbofan  applications,  and  also  for  low-bypass  engines 
at  supersonic  flight  conditions),  the  enthalpy  of  the  exhaust  as  seen  in  the  reference  frame  of  the  environment  through 
which  the  engine  is  moving,  also  decreases.  In  the  unattainable  limiting  steady  state  where  the  exhaust  is  moving  aft 
at  the  same  speed  the  aircraft  is  moving  forward,  the  thrust  disappears,  the  propulsion  efficiency  becomes  unity  and 
the  contrail  factor  becomes  infinite. 

As  will  be  shown  below,  contrail  factors  estimated  using  this  approach  applied  to  calculated  engine  cycle  are  in 
reasonable  agreement  with  contrail  factors  inferred  from  observations  of  contrail  formation  behind  aircraft  in  steady 
flight.  All  of  the  results  presented  apply  to  steady-state  conditions.  Effects  of  transient  conditions,  such  as  the 
decelerations/accelerations  discussed  in  Saatzer  (1995a),  are  not  accounted  for  in  these  calculations. 
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REVISED  CONTRAIL  FACTORS  for  LOW-  and  HIGH-BYPASS  TURBOFAN  ENGINES 


Engine  cycle  calculations  were  performed  with  ONX/OFFX,  a  gas  turbine  engine  cycle  computation  package  that  is 
a  companion  to  Mattingly  et  al  (1987).  A  variety  of  engine  cycles  were  explored,  with  an  emphasis  on  low-bypass 
and  high-bypass  turbofan  engines  based  on  examples  given  in  Mattingly  et  al  (1987).  Both  on-design  and  off-design 
operating  points  were  computed.  A  particular  set  of  initial  parameters,  including  compressor  pressure  ratio  and 
bypass  ratio,  was  chosen,  and  then  iterations  were  performed  to  find  the  engine  parameters  to  optimize  specific  fuel 
consumption  for  a  particular  flight  condition  using  ONX.  Once  the  engine  parameters  were  established  for  the 
design  condition,  OFFX  was  used  to  look  at  performance  characteristics  of  this  engine  at  other  flight  conditions. 

Results  of  engine  cycle  computations  also  were  obtained  from  Aeronautical  Systems  Center  (ASC)  for  a  typical  low- 
bypass  and  a  typical  high-bypass  engine.  These  results  are  more  accurate  than  those  obtained  using  ONX/OFFX 
because  the  ASC  computations  involve  fewer  simplifications. 

Three  estimates  of  a  contrail  factor,  the  ratio  of  water  to  enthalpy  added  to  the  air  passing  through  the  engine,  are 
presented  below  for  low-bypass  and  high-bypass  engines  for  several  flight  conditions. 


1.  The  first  method  follows  the  Appleman  method.  Here,  it  is  assumed  that  1.28  Ibm  of  water  and  18400  BTU  are 
produced  as  a  result  of  the  combustion  of  1  Ibm  of  fuel,  following  Jiusto  and  Pilie  (1958).  In  the  standard  units 
used  by  Appleman  and  followed  by  others  since,  the  contrail  factor  if  all  of  this  water  and  enthalpy  were  in  the 
exhaust  would  be  0.030  (g/kg)/C,  independent  of  engine  setting,  flight  condition,  or  any  other  variable 
associated  with  aircraft  operation.  For  the  comparisons  to  follow,  the  same  assumption  about  water  production 
is  made  for  each  method,  so  the  relative  comparisons  would  not  change  if  a  different  ratio  of  water  to  fuel  were 
assumed.  The  contrail  factor  computed  using  the  traditional  Appleman  method  is  used  as  a  reference.  It  is 
impossible  for  the  exhaust  to  contain  more  energy  than  was  generated  by  combustion,  although  due  to  parasitic 
flows,  power  extraction  for  accessories,  etc.,  the  energy  in  the  exhaust  could  certainly  be  less  than  that  generated 
by  combustion.  If  the  water-generated  to  fiiel-combusted  ratio  is  constant,  and  the  energy  in  the  exhaust  must  be 
less  than  or  equal  to  the  energy-to-fuel  combusted  ratio,  then  the  contrail  factor  of  an  engine  in  operation  should 
never  be  less  than  this  “Appleman”  factor,  although  it  could  be  higher. 

2.  A  second  estimate  is  derived  making  the  same  assumption  about  water  production,  but  estimating  the  enthalpy 
added  to  the  air  passing  through  the  engine  by  taking  the  difference  between  the  total  enthalpy  at  station  9  (the 
exhaust  plane)  based  on  the  cycle  computation  of  the  stagnation  temperature  there,  and  the  static  enthalpy  in  the 
environment.  In  this  method,  the  exit-plane  enthalpy  is  the  enthalpy  of  the  exhaust  in  the  frame  of  reference 
moving  with  the  engine. 


3.  The  third  method  assumes  the  same  relationship  between  fuel  consumption  and  water  production,  but  the 
enthalpy  change  in  the  air  flowing  through  the  engine  is  taken  to  be  the  total  enthalpy  in  a  reference  frame 
relative  to  the  air  through  which  the  engine  moves,  minus  the  static  enthalpy  in  the  environment. 


Results  for  the  low  bypass  engine  are  computed  assuming  complete  mixing  of  the  core  and  duct  air  flows  prior  to  the 
exit  plane.  The  situation  is  more  complex  for  the  high-bypass  engine  cycles.  In  some  designs,  the  bypass  and  core 
streams  are  mixed  prior  to  leaving  the  nozzle.  In  others,  the  fan  stream  exits  upstream  and  concentric  to  the  core 
stream.  For  this  reason,  contrail  factors  for  the  generic  high-bypass  engine  are  computed  by  both  methods  2  and  3, 
for  two  extreme  situations.  The  first  (a)  assumes  that  core  air  is  mixing  in  an  undiluted  form  with  ambient  air.  The 
second  (b)  assumes  that  core  and  duct  air  are  completely  mixed  prior  to  contrail  initiation,  and  that  this  mixture  has  a 
mass-weighted  average  speed.  In  actuality,  some,  but  not  complete,  mixing  between  core  and  duct  air  probably 
occurs  in  most  cases  prior  to  contrail  initiation,  and  the  observed  contrail  factor  probably  would  lie  between  these 
two  extremes  in  most  installations. 

Examples  of  results  computed  for  a  low-bypass  engine  over  a  range  of  flight  conditions  at  30  kft  altitude  in  a 
standard  atmosphere  are  shown  in  Figures  4  and  5.  The  contrail  factor  in  Figure  4  is  computed  in  the  aircraft  frame 
of  reference  (Method  2)  while  Figure  5  shows  results  computed  in  a  frame  of  reference  moving  with  the  ambient  air 
(Method  3). 

In  the  aircraft  frame  of  reference.  Figure  4  shows  that  the  contrail  factor  decreases  with  increasing  flight  Mach 
number,  and  generally  increases  with  increasing  power  setting  (PLA)  at  all  flight  Mach  numbers  except  M  =  0.4.  At 
M  =  0.4  and  M  =  0.5  there  is  very  little  dependence  of  contrail  factor  on  power  setting.  The  total  range  of  variation 
is  about  +/-7%  from  the  mid-range  of  contrail  factor  values  over  the  range  of  flight  conditions  examined.  All 
computed  factors  are  less  than  the  Appleman  factor  estimated  using  Method  1.  This  relationship  shows  that  these 
computations  are  incorrect.  A  contrail  factor  lower  than  that  computed  from  fuel  properties  can  only  arise  if  there  is 
higher  enthalpy  per  unit  of  fuel  consumed  compared  to  the  theoretical  heating  value.  While  it  is  possible  for 
enthalpy  to  be  lost  from  the  engine  air  flow  in  parasitic  flows  (bleed  air  extraction,  accessory  power  take-off,  etc.) 
there  is  no  way  to  add  enthalpy  to  the  flow  beyond  that  released  by  fuel  combustion. 
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Figure  4.  Contrail  factors  computed  in  the  aircraft  frame-of-reference  for  a  low~bypass  engine  for  4  power  settings 
(PLA).  The  engine  is  operating  at  30000ft  MSL  in  a  standard  atmosphere.  Results  are  shown  for  flight  Mach 
numbers  ranging  from  0.4  to  0.9,  at  each  power  setting. 


Figure  5  shows  that  contrail  factors  are  generally  higher  when  computed  in  the  frame  of  reference  of  the  air  through 
which  the  aircraft  is  moving.  None  of  the  contrail  factors  are  less  than  the  “Appleman”  contrail  factor  estimated 
using  Method  1 .  The  generally  higher  values  arise  because  there  is  less  total  enthalpy  in  this  reference  frame,  while 
the  water  content  is  the  same  as  estimated  using  Method  2.  There  is  small  variation  in  contrail  factor  for  50  <  PLA  < 
87.5,  but  a  big  jump  in  contrail  factor  between  PLA  =  50  and  PLA  =  18.  These  “corrected”  contrail  factors 
generally  increase  with  increasing  flight  Mach  number  at  constant  PLA  because  the  exhaust  is  moving  more  slowly 
relative  to  the  air  as  the  aircraft  speeds  up.  In  other  words,  the  exit  plane  exhaust  speed  becomes  less  in  this 
reference  frame  as  Mach  number  increases,  lowering  the  total  enthalpy  in  the  exhaust.  At  a  given  Mach  number,  the 
“corrected”  contrail  factor  increases  with  decreasing  power  setting  for  the  same  reason,  that  is,  exhaust  speed  relative 
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to  the  air  is  decreasing  as  PLA  decreases.  The  total  range  of  variation  of  the  “corrected”  contrail  factor  is  about  +/- 
25%  over  the  range  of  conditions  examined. 


Low-Bypass 

30  kft 


Figure  5.  Results  for  contrail  factor  in  a  reference  frame  at  rest  relative  to  the  air,  in  the  same  format  as  Figure  4. 


Figures  6  and  7  show  similar  information  to  that  given  in  Figures  4  and  5,  for  a  range  of  altitudes  between  25000  and 
50000  feet  in  a  standard  atmosphere.  The  range  of  variation  of  contrail  factor  over  the  range  of  Mach  numbers  and 
power  settings  examined,  diminishes  somewhat  with  altitude.  Overall  mass  flow  through  the  engine  diminishes  with 
increasing  altitude,  and  at  the  higher  altitudes  there  is  less  variability  in  engine  conditions  over  the  same  range  of 
power  settings.  However,  other  major  trends  are  as  shown  in  Figure  4  and  5.  Again,  it  is  seen  that  Method  2  (Figure 
6)  yields  unrealistic  results  because  most  of  the  contrail  factors  are  less  than  the  “Appleman”  one,  while  Method  3 
(Figure  7)  yields  more  plausible  estimates  of  contrail  factors  in  that  all  factors  are  equal  or  greater  than  the 
Appleman  one.  The  contrail  factors  at  higher  power  settings  are  in  the  range  estimated  in  Schrader  (1997),  based  on 
flight  test  observations  of  low-bypass  turbofan-equipped  aircraft  making  contrails.  They  are  lower  than  those 
estimated  in  Peters  (1993) 
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Figure  6.  As  in  Figure  4,  but  showing  estimates  for  a  similar  range  of  power  setting  and  flight  Mach  number  over  a 
range  of  altitudes. 


The  situation  is  somewhat  more  complex  for  high-bypass  engines,  as  2  contrail  factors  can  be  computed  for  each 
reference  frame.  Figure  8  shows  results  for  contrail  factors  estimated  by  Method  2a  (aircraft  reference  frame, 
unmixed  core  air).  Contrail  factors  are  generally  a  factor  of  2  higher  than  for  the  low-bypass  engine.  This  is  due  to 
the  fact  that  all  of  the  water  generated  during  combustion  is  in  the  core  air,  but  a  significant  fraction  of  the  energy  is 
extracted  to  accelerate  the  duct  air.  Contrail  factor  decreases  with  increasing  power  setting  because  at  higher  power 
settings  proportionally  less  energy  is  extracted  to  the  duct  air  stream. 

Figure  9  shows  results  for  the  “corrected”  core  contrail  factor  (method  3a).  These  corrected  contrail  factors  are 
slightly  higher  than  those  shown  in  Figure  8,  due  to  the  lower  total  enthalpy  in  the  exhaust  in  the  frame  of  reference 
of  the  air.  The  range  of  variation  at  a  given  power  setting  is  smaller  than  for  the  “standard”  contrail  factor.  It  is  not 
possible  to  compare  the  estimates  in  Figure  8  and  9  to  the  fuel-property-based  “Appleman”  factor,  because  some 
fraction  of  the  enthalpy  released  by  combustion  is  present  in  the  fan  stream.  However,  by  mixing  the  fan  and  core 
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Figure  7.  As  in  Figure  5,  but  for  a  range  of  altitudes. 


streams  for  the  high-bypass  engine,  and  computing  a  contrail  factor  for  the  mixed  exhaust,  we  have  a  situation  where 
all  enthalpy  should  be  accounted  for  in  the  computation  of  the  contrail  factor.  This  mixing  is  used  to  construct  Figure 
10.  Figure  10  shows  standard  contrail  factors  estimated  assuming  that  core  and  duct  air  streams  are  completely 
mixed  prior  to  contrail  formation.  They  are  much  lower  than  the  contrail  factors  shown  in  Figure  8,  and  for  many 
conditions  shown  are  less  than  the  fuel-based  “Appleman”  factor.  Ideally,  the  mixed  stream  contrail  factors  in 
Figure  10  should  be  very  close  to  those  in  Figure  6  for  the  low-bypass  engine,  and  both  should  be  greater  than  or 
equal  to  the  “Appleman”  contrail  factor,  0.030  (g/kg)/°C.  The  fact  that  for  most  conditions  these  contrail  factors  are 
less  than  the  Appleman  factor  suggests  that  they  are  computed  incorrectly. 
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Figure  8.  Contrail  factors  in  the  aircraft  frame  of  reference  estimated  for  the  core  stream  of  a  high-bypass  turbofan 
engine.  Results  are  presented  for  6  altitudes,  three  power  settings  (RC)  and  flight  Mach  numbers  ranging  from  0.4 
to  0.9. 

Figure  1 1  shows  that  the  “corrected”  contrail  factors  for  the  mixed  streams  are  higher  than  the  “standard”  contrail 
factors  for  the  mixed  streams  shown  in  Figure  10,  but  lower  than  the  “corrected”  contrail  factors  for  the  core  streams 
shown  in  Figure  9.  Compared  to  the  results  in  Figure  10,  the  contrail  factors  are  higher  in  Figure  1 1  because  the 
total  enthalpy  in  the  exhaust  is  less  in  the  frame  of  reference  of  the  air.  There  is  a  stronger  trend  for  decreasing 
contrail  factor  with  increasing  power  setting  in  Figure  1 1  compared  to  Figure  10  because  in  Figure  1 1  the  exhaust 
speed  in  the  air  frame  of  reference  increases  more  dramatically  with  power  setting. 
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Most  importantly,  none  of  the  mixed-stream  contrail  factors,  in  the  reference  frame  of  the  air,  are  less  than  the  fuel- 
based  “Appleman”  factor.  The  contrail  factors  shown  in  Figure  1 1  are  generally  lower  than  those  estimated  in  Peters 
(1993)  for  high-bypass  engines,  but  slightly  higher  than  those  given  in  Schrader  (1997)  as  being  deduced  from  flight 
tests.  This  suggests  that  the  engines  observed  in  these  flight  tests  probably  had  well-mixed  fan  and  core  streams  very 
close  to  the  exit  plane.  It  is  possible  that  the  assumption  used  above,  that  1.28  kg  of  water  are  generated  during  the 
combustion  of  1  kg  of  fuel,  is  a  bit  optimistic.  A  value  of  1.22,  as  suggested  in  Busen  and  Schumann  (1995)  would 
lower  the  contrail  factors  in  Figure  11  to  a  range  closer  to  that  suggested  in  Schrader  (1997)  for  high-bypass  engines. 
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Figure  10.  Contrail  factors  for  the  high-bypass  engine  computed  in  the  reference  frame  of  the  engine,  assuming  fan 
and  core  streams  are  completely  mixed  prior  to  contrail  formation. 


DIAGNOSIS  of  CONTRAIL  FACTORS  from  PHILLIPS  LABORATORY  CONTRAIL  OBSERVATIONS 

During  the  fall  of  1995,  the  Air  Force  Phillips  Laboratory  (now  Air  Force  Research  Laboratory  at  Hanscom  AFB) 
undertook  an  intensive  observation  campaign.  (Jackson  et  al,  1997)  Radiosondes  were  launched  from  5  sites  in  the 
Boston  area  at  intervals  as  small  as  3  hours.  Observers  at  these  5  sites  observed  aircraft  and  whether  or  not  they 
were  making  contrails.  Confirmation  was  obtained  in  real  time  from  a  nearby  Federal  Aviation  Administration 
(FAA)  en-route  traffic  control  center  of  the  altitude  and  aircraft  type  for  each  sighting  by  the  observers. 

There  were  563  observations  of  aircraft  in-flight  made  in  a  2- week  period.  A  summary  of  these  data  are  shown  in 
Figure  12.  From  this  set,  a  subset  of  -100  observations  was  extracted  in  which  visual  descriptions  of  the  contrail 
observed  were  recorded.  It  was  examined  for  short-lived  contrails.  Contrails  that  are  short-lived  (last  less  than  a  few 
seconds)  are  short-lived  both  because  the  air  in  which  they  form  is  dry  and  subsaturated  with  respect  to  liquid  (and 
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Figure  1 1.  Aj  in  figure  10,  but  for  the  contrail  factor  computed  in  the  reference  frame  of  the  air  through  which  the 
aircraft  is  moving. 

probably  subsaturated  with  respect  to  ice),  and  because  the  conditions  in  which  they  are  forming  are  at  the  very 
threshold  for  contrail  formation.  In  the  context  of  Figure  3,  the  mixing  line  is  just  tangent  to  the  saturation  mixing 
ratio  curve  and  barely  crosses  above  it.  A  group  of  59  observations  of  short-lived  contrails  was  selected  for  further 
analysis. 

Environmental  conditions  in  which  short-lived  contrails  were  observed  were  interpolated  from  the  dense  network  of 
soundings  to  the  location  of  the  contrail  observation.  Looking  at  Figure  3,  if  the  environmental  condition  is 
specified,  and  a  line  is  drawn  from  it  that  is  just  tangent  to  the  saturation  mixing  ratio  curve  in  Figure  3,  then  the 
slope  is  determined  for  the  mixing  line  describing  the  mixing  that  produced  this  non-persisting  contrail.  This  slope  is 
the  contrail  factor.  This  represents  a  lower  limit  to  the  contrail  factor,  as  the  contrail  factor  could  actually  be 
somewhat  higher,  and  if  the  ambient  air  was  dry  enough,  even  if  the  mixing  line  passed  markedly  into  the  saturated 
region,  the  plume  could  mix  out  to  a  subsaturated  state  quickly  and  yield  a  short-lived  contrail. 
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Figure  12,  Aircraft  observations  as  a  function  of  ambient  temperature  and  pressure. 

From  the  aircraft  type,  as  obtained  from  the  FAA,  an  estimate  was  made  of  the  bypass  ratio  of  the  engines  on  the 
observed  aircraft.  This  estimate  was  made  using  Jane’s  All  the  World’s  Aircraft  (Lambert,  1991).  Many  aircraft 
types  may  have  different  engines  with  a  range  of  bypass  ratios.  In  cases  where  more  than  one  bypass  ratio  applied  to 
the  various  engines  in  common  use  on  aircraft  of  a  given  type,  a  mean  bypass  ratio  was  assigned  to  all  aircraft  of  that 
type. 

Contrail  factors  were  inferred  by  a  numerical  iteration  procedure  analogous  to  fitting  a  line  to  the  two  known  points 
on  Figure  3,  for  each  short-lived  contrail  observed  during  the  study.  The  contrail  factor  estimates  are  summarized  in 
Figure  13. 
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Figure  1 3.  Inferred  contrail  factors  based  on  59  observations  of  short-lived  contrails.  The  temperatures  CC)  at 
which  the  contrails  were  observed  are  plotted  as  a  function  of  the  deduced  contrail  factor  and  the  estimated  bypass 
ratio. 

In  general,  the  range  of  inferred  contrail  factors  increases  with  increasing  estimated  bypass  ratio.  Most  observations 
yield  a  contrail  factor  between  0.030  and  0.10  (g/kg)/C,  consistent  with  estimates  of  contrail  factors  for  low  and 
high-bypass  gas  turbine  engines  in  a  reference  frame  moving  with  the  air,  given  above.  Some  inferred  contrail 
factors  are  less  than  0.030  (g/kg)/C.  However,  the  estimating  procedure  yields  only  a  lower  bound  on  the  contrail 
factor,  and  so  the  actual  factor  in  these  cases  could  have  been  greater  than  the  inferred  one.  Figure  14  shows  only 
those  observations  yielding  contrail  factors  less  than  0.10  (g/kg)/C  in  order  to  make  the  lower  contrail  factor 
distribution  more  discernible.  The  range  of  variation  of  these  observations  agree  very  well  with  the  theory  for 
contrail  factor  calculation  in  the  frame  of  reference  moving  with  the  air,  as  developed  above. 
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Figure  14,  The  region  of  Figure  13  corresponding  to  contrail  factors  less  than  0.10  (g/kg)/C. 


Five  contrails  in  Figure  13  were  observed  at  temperatures  higher  than  -40  C.  These  observations  yield  inferred 
contrail  factors  much  greater  than  0.10  (g/kg)C.  They  all  were  produced  by  high-bypass  engined  aircraft.  These 
factors  are  not  consistent  with  steady-state  engine  operation  according  to  the  generic  high-bypass  engine  cycle  data 
used  in  this  study.  It  is  possible  they  were  produced  by  aircraft  in  non-steady-state  flight  or  at  a  low  power  setting 
during  a  slow  descent  where  the  exhaust  gas  velocity  was  even  closer  to  the  forward  airspeed  than  for  the  low  power 
setting  engine  cycle  data  used  in  this  study.  A  low  throttle-setting  and  low  exhaust  speed  could  lead  to  a  very  low 
total  enthalpy  in  the  exhaust  in  the  reference  frame  of  the  air,  leading  to  very  high  contrail  factors.  It  would  be  of 
interest  to  Air  Force  forecasters  to  study  situations  like  this,  as  current  contrail  forecasting  procedures  would  not 
recognize  the  possibility  of  contrail  formation  at  such  high  temperatures,  no  matter  what  the  humidity. 

There  are  many  uncertainties  in  inferring  bypass  ratios,  and  there  are  additional  uncertainties  in  inferring  contrail 
factor  caused  by  uncertainties  in  the  temperature,  pressure,  and  most  importantly  humidity,  in  the  contrail 
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environment.  Humidities  from  radiosonde  instrument  packages  at  low  temperatures  are  characteristically 
underestimated.  An  underestimate  of  humidity  would  yield  an  overestimate  of  contrail  factor  (see  Figure  3). 

Given  these  uncertainties,  the  bulk  of  the  data  presented  in  Figures  13  and  14  support  the  theory  of  computing  the 
contrail  factor  using  exhaust  enthalpy  in  the  reference  frame  of  the  air.  Contrail  factors  computed  in  the  aircraft 
frame  of  reference  are  generally  lower  than  those  estimated  from  the  observational  data  set.  Contrail  factors 
computed  in  the  reference  frame  of  the  air  through  which  the  aircraft  is  moving  are  generally  higher,  more  variable, 
and  closer  to  the  range  of  those  inferred  from  observations.  The  5  observations  that  yield  factors  that  exceed  the 
contrail  factors  characteristic  of  high-bypass  gas  engines  at  the  flight  conditions  investigated  must  have  been 
characterized  by  a  flight  environment  and/or  flight  settings  outside  the  range  of  these  conditions. 


SUMMARY  and  CONCLUSIONS 

A  physically  consistent  procedure  for  computing  contrail  factors  based  in  engine  cycle  calculations  has  been 
presented.  It  follows  procedures  used  in  past  studies,  with  the  exception  that  care  is  taken  in  the  estimation  of 
exhaust  gas  total  enthalpy.  The  most  important  consideration  is  that  the  total  enthalpy  of  the  exhaust  must  be 
computed  in  the  frame  of  reference  of  the  air  through  which  the  engine  is  being  carried  by  the  aircraft.  Contrail 
factors  computed  in  this  way  are  equal  to  or  greater  than  the  contrail  factor  of  0.030  (g/kg)/C  estimated  assuming  all 
water  and  enthalpy  generated  during  combustion  exits  the  tail  pipe.  They  can  be  greater  than  this  because  some  of 
the  energy  generated  by  combustion  is  used  to  propel  the  aircraft,  and  some  is  lost  in  parasitic  flows.  In  contrast, 
contrail  factors  estimated  assuming  the  EGT  represents  the  total  enthalpy  in  the  exhaust  yield  contrail  factors  too 
low,  particularly  when  the  exhaust  speed  is  relatively  closer  to  the  airspeed. 

When  exhaust  gases  have  high  exit  speeds  relative  to  the  airspeed  of  the  aircraft,  fan  and  core  streams  are  well- 
mixed,  and  there  is  little  energy  extracted  in  the  form  of  parasitic  flows,  the  contrail  factor  should  approach  0.030 
(g/kg)/°C.  For  high-bypass  engines,  which  move  more  air  more  slowly  than  low-bypass  engines,  contrail  factors 
should  be  generally  higher 

It  is  seen  that,  contrary  to  assumptions  made  in  current  methods  of  contrail  forecasting,  the  contrail  factor  is  not 
constant  for  a  given  engine  type  for  all  flight  conditions.  It  varies  even  for  the  same  engine  at  different  power 
settings  and  flight  conditions.  Cycle  calculations  were  done  for  a  range  of  conditions  for  a  low-bypass  and  a  high- 
bypass  engine.  This  range  included  flight  at  altitudes  between  25000  and  50000  ft  in  a  standard  atmosphere,  Mach 
numbers  ranging  from  0.4  to  0.9,  and  power  settings  from  idle  to  military.  The  contrail  factors  range  from  0.030  to 
0.053  (g/kg)/'C)  for  low-bypass  engines,  and  from  0.038  to  0.090  (g/kg)/®C  for  high-bypass  engines,  over  this  range 
of  conditions.  Contrail  factors  are  generally  higher  at  lower  power  settings  at  a  given  Mach  number,  and  higher  at 
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this  range  of  conditions.  Contrail  factors  are  generally  higher  at  lower  power  settings  at  a  given  Mach  number,  and 
higher  at  higher  Mach  numbers  at  a  given  power  setting.  Contrail  factors  tend  to  be  higher  at  higher  altitudes  in 
general.  Finally,  contrail  factors  are  generally  higher  for  high-bypass  ratio  engines  than  for  low-bypass  engines. 

Contrail  factors  are  diagnosed  from  a  detailed  set  of  observations  of  contrails  forming  at  threshold  conditions  for 
formation.  These  contrail  factors  are  compared  to  those  computed  from  cycle  calculations.  Generally  good 
agreement  is  found  between  the  range  of  contrail  factors  inferred  from  observations  and  those  estimated  from  cycle 
calculations.  A  few  observations  of  contrails  forming  at  relatively  high  temperatures  are  represented  by  engine 
cycles  or  flight  conditions  inconsistent  with  those  used  in  this  study. 
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Abstract 


We  have  developed  a  flowing-afterglow  apparatus  for  the  study  of 
ion-molecule  reactions  at  high  temperatures  (HTFA).  Using  this  apparatus  we  have 
studied  the  rate  constants  for  eight  reactions  over  the  temperature  range  300-1  BOOK. 
Comparing  to  previous  drift  tube  measurements  allows  the  rate  constants  for 
vibrationally  excited  neutrals  to  be  derived. 
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MEASURMENTS  OF  ION-MOLECULE 
REACTIONS  AT  VERY  HIGH  TEMPERATURES 


Itzhak  Dotan 


Introduction 

The  need  for  understanding  the  chemical  behavior  of  plasmas  has  spawned  the 
development  of  techniques  for  studying  ion-molecule  reactions  at  elevated 
temperatures.  High  temperature  ion  chemistry  occurs  in  the  ionosphere,  in  combustion 
situations,  in  plasma  chemical  reactors,  and  in  plasmas  associated  with  atmospheric 
reentry  and  hypersonic  flight.  In  the  case  of  atmospheric  reentry,  e.g.,  a  plasma  may 
envelope  the  vehicle  and  interfere  with  radio  communication.  Computer  models  are 
used  to  describe  the  complicated  chemistry  in  these  plasma  situations,  such  as  reentry, 
but  the  models  require  accurate  and  meaningful  kinetics  data  in  order  to  describe  and 
predict  the  plasma  properties.  While  there  is  a  wealth  of  ion  chemical  kinetics  data 
available  in  the  literature,  very  little  covers  temperatures  above  300  K,  and  such  data 
above  900  K  are  essentially  nonexistent.  High  temperature  kinetics  data  are  often 
required  for  modeling  plasmas  at  elevated  temperatures  because  extrapolation  of  low 
temperature  data  is  unreliable. 

One  way  to  circumvent  the  temperature  limitation  has  been  to  study  reactions  at 
elevated  ion  energy  in  drift  tubes  and  ion  beam  apparatuses.  These  techniques  have 
been  highly  successful  and  produced  an  abundance  of  data  on  many  systems. 
However,  results  from  these  techniques  offer  only  a  guide  for  predictions  on  the 
behavior  of  reaction  rate  constants  and  product  branching  fractions  at  elevated 
temperatures  because  it  is  the  translational  energy  (and  perhaps  the  internal  energy  of 
the  ion)  that  is  raised  in  ion  beam  and  drift  tube  experiments.  It  has  been  shown  that 
temperature  and  kinetic  energy  often  have  differing  effects  on  reactivity.  We  have 
studied  many  systems  as  functions  of  both  temperature  and  kinetic  energy  and  have 
observed  a  variety  of  behaviors,  e.g.,  sometimes  increasing  temperature  increases 
reactivity  while  kinetic  energy  decreases  reactivity.  In  other  cases,  increasing 
temperature  decreases  reactivity  more  effectively  than  does  kinetic  energy.  Thus  in 
some  cases  kinetic  energy  dependence  may  be  misleading  indicators  of  temperature 
dependence,  and  clearly  there  is  a  need  for  measurements  at  true  temperatures. 
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While  the  kinetics  of  most  ion-molecule  reactions  have  been  measured  only  at  room 
temperature,  there  has  been  a  considerable  amount  of  work  in  studying  temperature 
dependence  of  ion-molecule  reactions  over  the  range  between  80-600  K.  These 
temperature  dependence  studies  have  shown  interesting  features  including  the 
important  observation  that  rate  constants  for  ion-molecule  reactions  frequently 
decrease  with  increasing  temperature,  presumably  because  the  lifetime  of  the  reaction 
complex  decreases  with  collision  energy. 

There  have  been  only  two  studies  where  rate  constants  of  ion-molecule  reactions  were 
measured  above  600  K.  The  first  set  of  measurements  was  made  at  the  NOAA 
Aeronomy  Laboratory  in  1974  using  a  flowing  afterglow  apparatus  designed  to 
operate  over  the  temperature  range  80-900  K.  A  total  of  9  reactions  were  studied  up  to 
900  K,  all  involving  simple  systems.  The  second  experiment  was  carried  out  at  the 
University  of  Pittsburgh  in  a  static  drift  tube  designed  to  operate  up  to  1500  K. 
Measurements  were  made  up  to  930  K  on  two  reactions  that  had  been  previously 
studied  by  the  NOAA  group.  Measurements  were  not  made  above  930  K  due  to 
thermionic  emission  of  alkali  ions  from  the  drift  tube  walls. 

Recently,  in  the  Philips  Laboratory  we  have  constructed  a  flowing  afterglow  to  study 
ion-molecule  reactions  at  temperatures  from  300  to  1800  K.  In  this  report  I  describe 
the  experimental  setup  and  the  results  of  8  reactions  studied  as  a  function  of 
temperature  up  to  1 800  K. 

The  following  reactions  were  studied: 
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Experimental 

The  measurements  were  made  in  a  high  temperature  flowing  afterglow.  This  apparatus 
has  been  described  in  detail  recently  [4]  and  just  a  brief  description  is  given  here.  Ions 
are  created  by  electron  impact  in  the  upstream,  cooled  section  of  a  ceramic  flow  tube. 
A  helium  buffer  carries  the  ions  downstream  where  the  flow  tube  is  heated  by  a 
commercial  furnace.  After  the  gases  have  reached  the  equilibrium  temperature,  the 
reactant  gas  is  added.  The  bulk  of  the  gas  is  pumped  by  a  Roots  blower,  a  small 
fraction  is  sampled;  the  ions  are  analyzed  by  a  quadrupole  mass  filter  and  detected  by 
an  electron  multiplier.  The  decay  of  the  primary  ion  signal  as  a  function  of  the  reactant 
neutral  gas  concentration  yields  the  rate  constant.  The  only  significant  change  in  the 
instrument  since  the  detailed  description  was  published  has  been  to  slightly  lengthen 
the  flow  tube  upstream  of  the  furnace.  This  allowed  for  measurements  up  to  1800  K 
by  increasing  the  distance  between  the  hot  zone  and  the  sealing  gaskets  and  o-rings. 
Two  minor  changes  included  the  use  of  a  turbopump  on  the  ion  lens  region  and  a 
separate  pump  for  the  vacuum  box.  The  system  is  described  in  Fig.  1. 

All  the  ions  were  made  from  their  parent  neutral  gas.  Sufficient  gas  was  added  to 
ensure  That  the  production  of  ions  was  complete  before  the  reactant  inlet.  Insufficient 
gas  resulted  in  curvature  to  the  data,  indicating  a  problem.  The  helium  was  passed 
through  a  liquid  nitrogen  trap  to  reduce  the  water  vapor  concentration.  The  reactant 
neutrals,  except  for  NO,  were  used  without  further  purification.  No  was  passed 
through  a  silica  gel  trap  at  170K  to  reduce  contamination.  No  attempt  to  measure  the 
product  ions  made  except  to  confirm  the  appearance  of  products  known  from  other 
results.  We  estimate  the  error  in  the  measurements  as  ±25%  and  ±15%  in  the  total  and 
relative  error,  respectively. 


\ 
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Results  and  Discussion 


+  O2 

Figure  2  shows  the  rate  constants  for  the  reaction  of  with  O2  as  a  function  of 
temperature.  The  rate  constants  increase  with  temperature  from  300K  to  1000  K, 
above  which  they  level  out  at  the  collision  rate  within  our  error  limits,  the  measured 
rate  constants  being  about  10%  higher  than  the  collisional  value.  It  was  not  possible  to 
determine  the  products  of  the  reaction  but  other  studies  [5,6]  have  indicated  that 
branching  is  43%,  51%,  and  6%  into  the  3  channels,  respectively. 

The  rate  constant  for  this  reaction  has  been  measured  in  a  number  of  studies.  [2,  5-10] 
Of  particular  interest  to  the  present  study  is  the  NOAA  study  of  the  temperature 
dependence  up  to  900  K  [2]  and  several  drift  tube  studies  [5,  9,  10]  of  the  rate 
constant  as  a  function  of  average  kinetic  energy.  In  Fig.  2,  we  plot  the  NOAA  data  as  a 
function  of  temperature  [2]  and  kinetic  energy.  [10]  The  drift  tube  collision  are 
converted  to  an  effective  temperature  by  the  relation  (3/2)kTtransiation=KEcenierofmass- 
Ttransiation  is  uscd  to  plot  the  data. 

The  NOAA  temperature  data  agree  with  the  present  results  within  our  15%  relative 
error.  Numerous  room  temperature  measurements  are  also  in  agreement  with  all  the 
above  mentioned  data.  [2,  5-10]  The  NOAA  drift  tube  data  plotted  in  Fig.  2  are  in 
good  agreement  with  the  temperature  dependence  data  at  low  field.  However,  the  drift 
tube  data  show  little  or  no  dependence  on  kinetic  energy  at  the  energies  of  interest  in 
this  study.  Between  1000  K  and  1600  K,  the  temperature  data  are  about  a  factor  of  2 
higher  than  the  drift  tube  data.  The  drift  tube  data  from  Johnsen  et  al.  [9]  are  very 
similar  to  the  NOAA  data  and  are  a  repeat  of  the  NOAA  experiment.  [5] 

Differences  in  rate  constants  as  a  function  of  temperature  and  drift  electric  field  at  a 
given  translational  energy  is  the  same  in  both  experiments.  We  have  studied  the 
internal  energy  dependences  of  numerous  reactions  using  this  technique  and  found 
that  differences  as  large  as  observed  here  are  usually  attributed  to  vibrational 
excitation.  However,  if  we  assume  that  all  vibrationally  excited  O2  reacts  at  the 
Langevin  rate,  [11]  there  is  not  enough  vibrational  excitation  in  O2  to  account  for  the 
difference  in  data.  The  curve  labeled  maximum  in  Fig.  2  assumes  that  the  v=0  rate 
constant  is  5.5x10'''’  cm'*  s’'  (the  average  drift  tube  data)  and  that  the  v>l  rate 
constant  is  l.lxlO"'’  cm^  s"'  (the  average  of  the  present  data  at  temperature  over  1000 
K,  just  slightly  greater  than  the  Langevin  rate). 
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Since  vibrations  cannot  fully  account  for  the  increase  in  reactivity,  one  or  more  of 
several  possibilities  must  also  help  explain  the  reactivity  difference: 

1)  One  of  the  data  sets  may  be  in  error.  However,  two  temperature  dependent  studies 
and  three  drift  tube  studies  are  in  good  agreement.  In  addition,  comparison  of 
results  from  our  laboratory  and  the  NOAA  laboratory  almost  uniformly  shows 
excellent  agreement.  [4]  This  seems  to  indicate  the  effect  is  real. 

2)  Increasing  rotational  excitation  leads  to  larger  rate  constants.  The  large  rotational 
effects  we  have  observed  previously  have  fallen  into  two  categories:  endothermic 
reactions  where  rotational  and  translational  energy  are  equivalent  forms  of  energy 
or  reactions  involving  species  with  large  rotational  constants.  Neither  case  seems 
to  fit  the  present  situation.  There  are  some  product  states  of  reaction  1  that  are 
nearly  thermoneutral  and  production  of  the  diatomic  product  in  v=l  would  be 

endothermic,  namely  ■*■)  +  and  .  [12]  O’Keefe 

et  al.  [13]  have  found  no  evidence  of  the  former  and  that  the  latter  channel 
accounts  for  15%  of  the  total  reactivity  at  300  K.  Howorka  et  al.  [5]  found  that  the 
total  channel  producing  increases  with  kinetic  energy  indicating  that  the 

O2  +  +  ATpDjchannel  may  become  more  important  with  increasing  energy. 

Rotational  energy  may  then  be  efficient  at  driving  this  channel.  However, 
rotational  and  translational  would  not  be  equivalent  forms  of  energy,  in  contrast  to 
what  we  have  observed  in  previous  studies  involving  endothermic  reactions. 

3)  The  populations  of  the  spin  orbit  states  of  N'^  will  vary  more  with  temperature 
than  with  kinetic  energy  with  respect  to  the  buffer  rather  than  with  respect  to  the 
reactant.  [14]  For  the  experiments  shown  in  Fig.  2  the  buffer  is  He  and  the  kinetic 
energy  with  respect  to  the  buffer  is  considerably  less  than  the  kinetic  energy  with 
respect  to  the  center  of  mass  of  the  reactants.  The  equilibrium  population  of  the 

Po  ,  ^  Pi ,  and  ^  P2  ,  states  of  N'^  at  300  K  are  17,  39  and  43%  respectively.  At 
1000  K  the  populations  are  13,  35,  and  52%,  respectively.  These  differences  also 
cannot  explain  all  of  the  difference  between  the  temperature  and  kinetic  energy 
data. 

Therefore,  we  are  led  to  conclude  that  the  observed  effect  is  due  to  a  complicated 
function  of  all  three  sources  of  internal  energy  which  is  difficult  to  unravel.  Another 
possibility  is  that  the  charge  transfer  channel  occurs  at  a  very  long  range  for  either 
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vibrationally  excited  O2  or  the  higher  level  N'^  spin  states  so  that  these  rate  constants  | 

would  be  appreciably  larger  than  the  Langevin  value. 

+  O2 

Figure  3  shows  the  rate  constants  for  the  reaction  of  N2  with  O2  as  a  function  of 
temperature.  The  rate  constants  decrease  with  temperature  up  to  about  1000  K  and 
increase  above  that.  The  reaction  produces  exclusively  O2*' .  Numerous  other  studies 
of  the  rate  constant  for  this  reaction  have  been  made.  In  Fig.  3  we  show  data  from  the 
NOAA  laboratory  as  a  function  of  temperature  [2]  from  300  to  900  K  and  as  a 
function  of  translational  temperature  measured  in  a  drift  tube.  [10]  Other 
measurements  are  similar.  [12,  14,  15,  17,  20,  22,  40].  The  NOAA  temperature 
dependent  data  are  similar  to  the  present  data,  being  essentially  identical  at  lower 
temperatures  and  about  15%  larger  than  the  present  data  at  900  K,  well  within  our 
error  limits. 

At  low  temperatures  (<1000  K),  the  drift  tube  rate  constants  are  larger  than  either  set 
of  thermal  rate  constants  at  the  same  translational  temperature  but  show  the  same 
trend  (i.e.  decreasing  rate  constant  with  increasing  temperature)  as  the  thermal  data. 

Above  a  translational  temperature  of  1000  K,  the  drift  tube  data  continue  to  decrease 
until  a  minimum  is  reached  at  3000-4000  K,  in  contrast  to  the  increase  seen  for  the 
temperature  data  above  1000  K. 

We  attribute  the  disparity  at  low  temperatures  between  the  drift  tube  and  the  thermal 
rate  constants  at  a  given  translational  temperature  to  the  difference  in  reactant  internal 
energy  in  the  two  types  of  experiments.  Below  1000  K  the  internal  energy  is 
predominantly  the  rotational  energy,  vibrations  being  mainly  in  the  ground  states.  To 
elucidate  the  role  of  reactant  rotational  energy  upon  reactivity,  we  replotted  the  rate 

i 

constants  from  Fig.  3  as  a  function  of  the  sum  of  average  translational  and  rotational 
energy  in  Fig.  4.  For  this,  the  thermal  data  are  plotted  vs.  (7/2)kT  and  the  drift  tube 
data  are  plotted  versus  the  original  kinetic  energy  plus  2kT  (with  T=300  K).  As  can  be 
seen  in  Fig  4  this  results  in  extremely  good  agreement  at  energies  below  0.3  eV.  This 
agreement  indicates  that  rotational  and  translational  energy  are  equally  detrimental  to 
reaction  are  low  energies.  We  have  seen  these  two  types  of  energy  being  equally 
important  previously,  although  usually  in  the  role  of  increasing  the  rates  of 
endothermic  reactions.  We  have  seen  other  examples  where  both  rotatioanl  and 
translational  energy  may  have  caused  the  rate  constant  to  decrease  but  the  effect  was 
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never  as  large  as  observed  here,  usually  just  at  the  maximum  of  our  relative 
uncertainty.  Therefore,  any  conclusion  we  made  was  tentative. 

A  qualitative  explanation  for  this  negative  dependence  of  the  rate  constants  upon 
reactant  energy  can  possibly  be  found  in  the  mechanism  suggested  by  Scherbarth  and 
Gerlich  [18]  to  account  for  the  similar  energy  dependence  observed  in  the  reaction  of 
Ar^  with  O2  .  Briefly  stated,  this  mechanism  assumes  that  the  reactants  approach  on  an 
attractive  potential  energy  surface  to  form  an  intermediate  complex  which  may  either 
separate  back  to  reactants  or  jump  at  a  crossing  seam  to  the  potential  energy  surface 
for  the  products.  If  the  latter  surface  is  repulsive,  the  probability  of  the  complex 
exiting  via  the  product  channel  would  increase  with  the  lifetime  of  the  complex  (i.e. 
the  number  of  times  the  trajectory  crosses  the  seam).  Since  increasing  either 
translational  or  rotational  energy  of  the  reactants  shortens  the  lifetime  of  the  complex, 
one  would  expect  a  decreasing  fraction  of  collision  to  lead  to  product  formation  as  the 
reactant  energy  (either  tranlational  or  rotational)  is  increased. 

From  the  data  shown  in  figure  4,  it  is  also  possible  to  derive  information  on  how 
vibrational  energy  effects  the  reactivity.  Several  previous  studies  have  shown  that  N2 
vibrations  to  not  affect  the  rate  constant  for  charge  transfer  [15,  19]  presumably 
because  the  overlap  between  states  of  A^2  ^2  ^^e  same  vibrational  level  are 

good  and  the  N2  (v)  reactant  results  in  a  N2  (v)  product.  Therefore,  the  difference 
between  the  temperature  and  kinetic  energy  data  above  0.3  eV  is  mainly  due  to 
vibrational  excitation  of  O2  in  the  HTFA  experiment  above  1000  K. 

From  the  two  data  sets  and  calculated  O2  vibrational  populations  as  a  function  of 
temperature,  it  is  possible  to  derive  the  rate  constants  for  vibrationally  excited  O2  . 
The  rate  constant  at  a  given  temperature  can  be  written  as 

k(T)  =  ^  ki*  popii) 

where,  k(T)  is  the  measured  rate  constant  as  a  function  of  temperature,  ki  is  the  rate 
constant  for  a  particular  vibrational  state,  and  pop(i)  is  the  population  of  that  state.  For 
the  reactions  with  O2  we  can  derive  vibrational  energy  effects  in  two  ways.  The 
first  is  to  assume  that  all  vibrational  excitation  behaves  similarly.  For  this  case,  we 
take  ko  as  the  drift  tube  data,  calculate  the  fraction  of  O2  that  is  vibrationally  excited 
from  the  known  temperature,  and  solve  for  a  /c^>i .  The  results  are  shown  in  Fig.  4  as 
the  O2  (v  >  1)  rate.  At  the  lowest  energies  the  rate  constants  are  found  to  rise  relatively 
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rapidly  with  increasing  energy,  probably  an  artifact  of  using  small  differences  between  | 

the  drift  tube  data  and  the  present  results  to  derive  the  rate  constants.  At  higher 
energies,  the  rate  constants  for  O2  (v  >  1)  are  about  a  factor  of  6  larger  than  those  for 
v=0  and  increase  slightly  with  increasing  energy. 

Production  of  ,  which  is  endothermic  by  0.577  eV  for  ground  state  reactants, 

is  essentially  thermoneutral  for  O2  (v=2).  Production  of  this  state  has  been  invoked  to 
explain  the  kinetic  energy  dependence  of  this  reaction  at  energies  above  1  eV  [20]. 

Production  of  has  been  observed  in  the  similar  reaction  of  Ar"^  with  O2  [18]. 

Therefore  it  is  totally  feasible  that  the  increase  in  the  rate  constant  we  observe  with 
increasing  temperature  is  dominated  by  an  increase  in  the  O2  (v  >  2)  population  and 
not  O2  (v  =  1) .  By  assuming  that  the  v  =  1  and  v  =  0  rate  constants  are  the  same,  we 
derive  rate  constants  for  O2  (v  >  2) .  The  results  of  this  calculation  are  also  shown  in 
Fig.  4.  The  roughness  in  the  two  curves  for  vibrationally  excited  O2  is  due  to  the 
small  bump  in  the  present  data  and  gives  an  indication  of  the  accuracy  of  the  derived 
rate  constants.  The  derived  rate  for  (v  >  2)  is  roughly  independent  of  energy  with  a 
rate  constant  of  approximately  1.5x10’’'^  cm^  s'* .  This  represent  a  very  large  increase 
in  the  rate  constant  compared  to  v  =  0,  but  the  value  is  reasonable  considering  that 
production  of  a  new  electronic  state  would  be  involved.  The  Franck  Condon  factors 
for  production  of  the  =  2)  state  are  large  supporting  this  interpretation.  Also 

lending  support  to  the(v  >  2)  interpretation  is  the  fact  that  an  appreciable  population  of 
V  =  2  (1%)  exists  just  about  at  the  minimum  in  the  temperature  dependent  data,  while 
at  the  same  temperature  the  v  =  1  population  of  v  =  1  is  14%.  While  the 
(v  >  2)  explanation  is  attractive  for  the  reasons  given  above,  it  is  not  possible  to  rule 
out  either  case,  and  in  fact  something  in  between  the  two  cases  is  likely. 


0+  +  N2 

Figure  5  shows  the  rate  constants  for  the  reaction  of  with  N2  as  a  function  of 
temperature.  The  rate  constants  are  very  small,  reaction  occurring  on  the  order  of  1  in 
10"^  collisions.  At  low  temperature  the  rate  constants  decrease  with  temperature  and 
show  a  minimum  in  the  1100-1300  K  range  before  rising  steeply  at  higher 
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temperature.  The  results  are  in  excellent  agreement  with  the  previous  measurements 
made  at  temperatures  up  to  900  K.  Shown  are  the  data  from  the  NOAA  [2]  laboratory. 
The  results  of  Chen  et.  al.[3]  Are  similar.  Several  drift  tube  studies  were  made  of  this 
reaction  [10,  21,  22].  Plotted  in  Fig  5  are  the  data  of  Albritton  et  al,  [21]  the  most 
recent  and  thorough  drift  tube  measurement.  The  other  studies  are  similar,  except  that 
the  rate  constants  near  the  minimum  in  the  Albritton  et  al.  Study  are  slightly  smaller. 
The  drift  tube  data  were  taken  as  a  function  of  ion  kinetic  energy  and  converted  to 
effective  translational  temperature  by  KE=1.5  kT.  The  drift  tube  data  agree  with  the 
present  data  within  experimental  uncertainty  up  to  1300  K.  Above  1300K  the  present 
thermal  results  increase  much  faster  than  the  drift  tube  results,  in  which  only  the  ion 
translational  energy  is  increased. 

Comparing  drift  tube  results  to  pure  temperature  results  allows  information  on  internal 
energy  dependence  of  the  reactivity  to  be  derived.  Any  difference  in  rate  constant  at 
the  same  translational  temperature  is  due  to  internal  energy.  For  the  reaction  of  N2 
with  this  means  the  N2  internal  energy.  The  good  agreement  between  the  pure 
temperature  and  kinetic  energy  data  at  low  temperatures  (<1000  K)  shows  that  N2 
rotational  energy  has  at  most  a  small  role  in  controlling  the  reactivity,  since  only  a 
small  fraction  of  N2  is  vibrationally  excited. 

The  difference  at  the  same  translational  temperature  between  the  drift  tube  studies  and 
the  present  at  high  temperature  is  due  to  N2  vibrations  becoming  excited  in  the 
thermal  experiments.  N2  (v=l)  is  already  populated  to  a  small  extent  by  1000  K,  yet 
no  clear  enhancement  in  the  rate  constant  is  observed  until  1300  K.  Since  there  is  only 
a  relatively  small  difference  in  the  v=l  population  between  1200  and  1300  K,  the  data 
suggest  that  the  increase  may  in  fact  come  from  a  large  rate  enhancement  for  the  v=2 
state  which  changes  in  population  by  almost  a  factor  of  2  over  this  temperature  range. 
The  speculation  that  the  increase  is  due  to  v=2  is  confirmed  by  the  experiments  of 
Schmeltekopt  et.  al.[3,  4]  In  their  experiment,  the  reactants’  translation!  and 
rotational  temperatures  were  fixed  at  300  K  while  the  N2  vibrational  temperature  was 
varied  from  300  to  6000  K  by  means  of  a  microwave  discharge.  The  measured  rate 
constants  increased  slightly  from  1000-1200  K  and  dramatically  above  that.  From 
their  data  they  derived  vibrational  state  specific  rate  constant  for  v  from  0-11.  They 
found  the  v=l  rate  constant  to  be  about  the  same  as  the  v=0  rate  constant.  The  rate 
constant  for  v=2  increased  by  about  a  factor  of  40  from  the  v=0  and  1  values.  The  rate 
constant  continued  to  increase  for  higher  v’s  until  a  value  close  to  the  collisional  value 
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was  reached  for  v=6.  This  shows  that  the  increase  in  our  pure  temperature  data  is  due 
almost  exclusively  to  v=2,  present  in  only  small  abundance. 

The  two  nonthermal  experiments  can  be  combined  to  compare  with  the  present  data. 
The  NOAA  vibrational  temperature  data  were  report  relative  to  the  300  K  rate 
constant.  Scaling  these  data  points  to  the  drift  tube  translational  temperature 
rather  than  scaling  to  the  300  K  rate  constant  allows  a  thermal  rate 
constant  to  be  predicted  with  both  vibrational  and  translational  effects  included.  This 
ignores  the  effects  of  rotational  excitation  (shown  above  to  be  at  most  a  small  effect). 
This  also  assumes  that  the  translational  energy  dependence  of  the  vibrationally  excited 
species  are  the  same  as  that  for  v=0.  The  results  of  this  prediction  are  shown  in  Fig.  5. 
Very  good  agreement  is  found.  Unsatisfactory  agreement  is  obtained  (not  shown)  if 
the  vibrational  temperature  data  are  plotted  relative  to  the  300  K  rate  constant.  This 
shows  that  both  vibrational  and  translational  energy  are  important  in  controlling  the 
rate  constant  in  this  temperature  range  and  that  the  assumption  of  similar  translation 
dependence  for  different  states  is  reasonable.  The  reason  for  the  strong  increase  at  v=2 
is  at  present  unexplained. 

0+  +  O2 

Figure  6  shows  the  rate  constants  for  the  reaction  of  with  O2  as  a  function  of 
temperature.  The  rate  constants  are  small,  although  they  are  about  a  factor  of  10  larger 
than  those  for  the  O'^  with  N2  reaction.  At  low  temperature,  the  rate  constants 
decrease  with  temperature  and  show  a  minimum  in  the  800-1100  K  range  before 
rising  at  higher  temperature.  Error  bars  of  ±  25  are  shown.  Several  previous  studies 
of  the  temperature  dependence  of  this  reaction  have  been  made.  At  NOAA,  the 
reaction  was  studied  from  100  to  900  K  [2,  25,  26]  Chen  et.  al  [3]  studied  the  reaction 
from  300  to  700  K.  The  most  recent  NOAA  results  are  shown  in  Fig  6,  the  other 
results  being  in  good  agreement  with  them.  The  present  data  agree  with  the  previous 
study  within  15%  at  all  temperatures  except  900  K.  At  900  K  the  present  results  are 
27%  higher  than  the  NOAA  results,  still  within  the  combined  uncertainty. 

In  addition  to  the  temperature  studies,  several  drift  tube  studies  have  been  made 
[10,  21,  22].  The  results  from  the  two  NOAA  studies  are  essentially  the  same,  and  the 
McFarland  et  al.  [10]  data  taken  in  a  helium  buffer  are  plotted  in  Fig.  6  as  well  as  the 
Johnsen  and  Biondi  [2]  data.  At  low  translational  temperatures,  the  rate  constants 
taken  in  the  drift  tube  studies  are  slightly  larger  than  those  found  for  the  pure 
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temperature  dependence  at  most  effective  temperatures.  This  is  especially  true  for  the 
data  of  Johnsen  and  Biondi.  However,  the  main  difference  between  the  data  is  that  the 
minimum  in  kinetic  energy  (outside  the  range  of  the  data  plotted  in  Fig  6)  is  much 
broader  and  occurs  at  higher  translational  temperatures  (-1600  K)  than  the  minimum 
for  the  pure  temperature  data.  The  pure  temperature  rate  constants  also  increase  much 
faster  with  increasing  temperature  than  do  the  drift  tube  data.  These  differences  are 
due  to  the  O2  vibrations  becoming  excited  at  high  temperatures.  The  slight  offset 
between  the  pure  temperature  and  kinetic  energy  data  makes  a  direct  derivation  of  the 
internal  energy  dependence  of  the  reaction  slightly  complicated.  In  order  to  facilitate 
this  comparison  we  scaled  the  drift  tube  data  so  that  the  present  results  and  the  drift 
tube  data  coincide  at  the  lowest  translational  temperature  studied,  namely,  400  K. 
Figure  7  shows  this  comparison  including  only  the  present  results  and  the  data  from 
McFarland  et  al.  for  simplicity.  The  scaling  factor  for  the  drift  tube  data  is  0.8,  or  a 
20%  offset,  which  is  within  the  uncertainty  of  either  data  set. 

In  order  to  facilitate  the  derivation  of  vibrational  energy  effects,  the  two  data  sets  are 
fit  to  the  following  expression: 

k  =  Ai(300  /  Tf  +  4  exp(-4  /  kT) 

where  4,4,  n,  and  4  are  adjustable  constants.  The  results  of  this  fitting  are  shown 
in  Fig  7  as  solid  lines  and,  as  can  be  seen,  are  very  accurate  representations  of  the  data. 
Below  the  minimum  in  the  pure  temperature  data,  there  is  excellent  agreement 
between  the  present  results  and  the  scaled  drift  tube  results.  This  shows  that  O2 
rotations  do  not  play  a  major  role  in  controlling  the  reactivity.  At  temperatures  above 
the  minimum,  the  pure  temperature  data  clearly  lie  above  the  drift  tube  data.  The 
separation  occurs  at  a  temperature  where  the  O2  vibration  starts  to  become  excited 
indicating  that  excited  O2  vibrations  increase  the  reactivity. 

From  the  two  fits  and  the  O2  vibrational  population  as  a  function  of  temperature,  it  is 
possible  to  derive  the  rate  constants  for  vibrationally  excited  O2.  The  rate  constant  at  a 
given  temperature  can  be  written  as  kiT)  =  X  where  k(T)  is  the  measured 

rate  constant  as  a  function  of  temperature,  k,  is  the  rate  constant  for  a  particular 
vibrational  state,  and  pop(/)  is  the  population  of  that  state.  We  take  ko  as  the  drift  tube 
data,  assume  that  all  vibrationally  excited  O2  reacts  at  the  same  rate,  and  derive  the 
fraction  of  O2  that  is  vibrationally  excited  from  the  known  temperature.  The  results  are 
shown  in  Fig  7  and  represent  the  translational  temperature  dependence  of  the  v>0  rate 
constant.  The  data  show  an  initial  rise  that  probably  has  to  do  with  the  fact  that  below 
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1200  K  the  drift  tube  data  and  the  present  results  are  similar  and  therefore  the 
derivation  depends  on  a  small  difference  between  two  large  numbers.  At  1200  K  and 
above,  the  present  data  continue  to  rise  but  at  a  much  slower  rate,  roughly  paralleling 
the  translational  temperature  dependence  for  v=0,  i.e.,  the  drift  tube  data,  but  a  factor 
of  4-5  larger. 

On  the  basis  of  thermochemistry  we  know  that  the  reaction  with  O2(v”=0)  can 
produce  Oj  in  states  from  v’=0-7  and  the  reaction  with  02(v”=l)  can  produce 
in  states  from  v’=0-8.  Frequently,  charge  transfer  is  fastest  when  the  process  is  energy 
resonant.  Therefore,  in  order  to  compare  Franck-Condon  factors  it  is  useful  to 
compare  the  product  states  for  (v’)  for  the  O2  (v”=l)  reaction  to  the  O2  (v’-l)  for 
the  O2  (v’-l)  for  the  O2  (v”=0)  reaction.  In  this  way  the  energy  resonances  are 
approximately  equal.  As  v’  increases  the  energy  gap  becomes  smaller,  and  finally  for 
v’=7  and  8  for  O2  (v”=0)  and  O2  (v”=l),  respectively,  the  energy  gap  reduces  to  a  few 
tens  of  wave  numbers.  For  these  near  resonant  products  states  and  the  two  next  closest 
resonant  states,  the  Franck-Condon  factor  ratio  is  in  the  4-5.6  region,  values  very  close 
to  the  experimental  rate  constant  ratio.  While  this  agreement  may  be  fortuitous,  it  does 
provide  a  simple  explanation  for  the  rate  constant  increase.  If  the  above  explanations 
is  true,  the  product  states  will  be  highly  vibrationally  excited,  a  result  that  can  be 
checked  experimentally  in  a  beam  apparatus.  The  exact  nature  of  the  rate  increase 
awaits  a  detailed  molecular  dynamics  calculation  beyond  the  scope  of  this  study. 


0+  +  NO 

The  charge  transfer  reaction 

0+(^S)  +  N0(2P)  ^  NO+('S)  +  0(^P)  +  4.3eV 

has  been  studied  intensively  using  flowing  afterglow  [27],  flow-drift  tube  [21,  28]  and 
static  drift  tube  techniques  [29].  The  reaction  has  been  found  to  be  very  slow  at  room 
temperature  (k  =  8  x  lO''"*  cm^  S"') .  Flow  drift  tube  data  [21,  28]  start  at  0.3ev  center 
of  mass  kinetie  energy  (k  =  3  x  10“'^  cm^  S'’ )  and  increase  very  fast  to  a  value  of 

2  X  10“'°cm^S''  at  3.5ev  eenter  of  mass  kinetie  energy.  The  gap  between  room 
temperature  and  0.3ev  center  of  mass  kinetie  energy  has  been  covered  by  the  static 
drift  experiment  of  Graham  et  al.  [29].  Their  results  show  that  the  rate  constants  do 
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not  vary  between  room  temperature  and  lOOOK  and  increase  steeply  above  lOOOK. 
These  results  are  a  little  unusual  since  generally  we  do  not  find  such  a  wide  minimum. 
Recently  Le  Garrec  et  al.  [30]  measured  the  reaction,  using  a  CRESU  machine,  as  a 
function  of  temperature  between  23  and  160K.  The  rate  constants  decrease  in  this 
range  from  a  value  of  3  x  10""cm^S"'  at  23K  to  3  x  10"'^cm^S'*  at  160K. 

We  have  used  the  HTFA  machine  [4]  to  measure  the  reaction  between  room 
temperature  and  1400K,  thus  covering  the  gap  between  the  flowing  afterglow  and 
flow-drift-tube  data. 

Figure  8  shows  the  rate  constants  for  the  reaction  of  0“^  ions  with  NO  as  a  function  of 
temperature.  The  rate  constants  increase  from  a  value  of  8  x  10"'^cm^S“'  at  300K  to 
2.6  X  10"'^  cm^  S“'  at  HOOK.  In  the  same  figure  we  also  plotted  NOAA’s  data 
measured  in  a  flow-drift  tube  apparatus  [21,  28],  Pittsburgh  data  using  static  drift  tube 
(29)  and  Rennes  data  using  CRESU  machine  [30].  Combining  all  these  sets  of  data  we 
can  establish  the  minimum  in  the  rate  constants  to  be  between  300-400K.  There  is  a 
very  good  agreement  between  our  room  temperature  data,  Pittsburgh  data  and  the 
room  temperature  flowing  afterglow  point  [27].  Above  500K  Pittsburgh’s  data  seems 
to  be  a  little  slower  than  the  present  and  NOAA’s  data,  but  the  three  sets  are  within 
the  combined  experimental  error. 

Comparing  temperature  to  kinetic  energy  dependence  of  rate  constants  could  lead  to 
the  role  of  vibrationally  excited  species  in  driving  the  reaction.  For  the  +  NO 
reaction  we  cannot  come  to  any  quantitative  conclusions  since  the  data  is  too 
scattered.  The  pure  temperature  dependence  data  seems  to  be  a  little  higher  than  the 
kinetic  energy  data,  thus  hinting  to  the  effect  of  vibrationally  excited  NO  in  driving 
the  reaction. 

The  reaction  proceeds  via  two  separate  mechanisms.  At  the  low  temperature  a  long 
lived  complex  is  formed  so  that  sufficient  curve  crossing  occur  to  allow  the  complex 
to  exit  in  an  exothermic  channel  NO^('S)  +  0(^S,'D,‘P)  .At  higher  temperatures 
(T>300K)  the  complex  is  not  being  formed  and  the  process  is  a  direct  one  appearing 
to  have  an  endothermic  threshold.  This  has  been  rationalized  by  Ferguson  [31]  to  be  a 
consequence  of  the  correlation  of  C^S)  and  NO(^7C)  with  the  excited  states  ^Ai,  ^Bi 
of  N02'*'  which  lie  slightly  above  the  reactants  in  energy  [32].  The  reaction  does  not 
occur  via  non-adiabatic  electron  jump  due  to  the  low  Frank-Condom  factor. 

We  fit  the  CRESU  and  the  HTFA  data  sets  to  the  following  expression: 
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k  =  A,  (300/1)"  +  A2exp(-  Ea/kT) 

where  Ai,  A2,  n  and  Ea  are  adjustable  constants.  The  fitting  is  shown  in  Fig  9,  it  is  not 
a  very  good  one  due  to  some  errors  in  the  high  temperature  points  of  the  CRESU 
machine.  (NO  was  not  purified  in  their  experiment  [33]). 

These  results  support  the  mechanism  suggested  above  since  the  activation  energy  is, 
within  the  experimental  error,  similar  to  the  endothermicity  of  the  process  leading 
from  0'^('‘S)  +  NO(^P)  to  N02(^A,,^Bi) .  (About  0. lev). 

On  top  of  the  above  described  reactions  we  have  measured  three  more  reactions,  for 
which  we  still  have  not  analyzed  the  data. 

a;  +  5O2 

The  results  for  this  reaction  are  shown  in  Fig.  10.  Up  to  about  900K  the  rate  constants 
decrease  with  increasing  temperature.  At  this  temperature  range  there  is  no  difference 
between  the  flow-drift  tube  [34]  and  the  HTFA  data.  About  900K  and  up  to  1400K  the 
HTFA  rate  constants  increase  by  about  a  factor  of  2  whereas  the  flow-drift  tube  rate 
constants  continue  to  decrease  and  start  increasing  only  above  4000K. 


a;  +  CO2 

The  results  for  this  reaction  are  shown  in  Fig.  1 1 .  Here  again  there  is  a  very  good 
agreement  between  the  HTFA  and  flow-drift  tube  [35]  data  up  to  600K.  Above  that 
temperature  the  HTFA  rate  constants  decrease  much  faster  than  the  flow-drift-tube 
data. 

CO^  +  O2 

The  results  for  this  reaction  are  shown  in  Fig.  12.  Here  we  compare  the  HTFA  data 
with  flow-drift-tube  data  taken  at  three  temperature  [36].  Here  we  find  a  similar 
behavior  to  the  reaction  A/  +  SO2  i.e  at  low  temperatures  (up  to  about  600K)  there  is 
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no  significant  difference  between  the  four  sets  of  data;  above  600K  the  HTFA  rate 
constants  increase  by  about  a  factor  of  6  up  to  1600K  where  as  the  drift  tube  rate 
constants  start  increasing  at  a  higher  temperature. 

The  results  of  the  last  three  reactions  will  be  analyzed  in  the  near  future  and  we  shall 
try  and  find  the  role  of  the  internal  excitation  in  driving  the  reactions. 


CONCLUSIONS 

The  rate  constants  for  a  few  ion-molecule  reactions  have  been  measured  for  the  first 
time  at  temperatures  up  to  1800K.  The  results  clearly  show  that  drift-tube  studies  do 
not  model  kinetics  at  high  temperatures  adequately.  The  problem  arises  mainly  from 
the  fact  that  vibrations  are  not  excited  in  drift-tube  measurements.  However, 
combining  the  present  high  temperature  data  with  previous  drift  tube  results  allows 
rate  constants  for  vibrationally  excited  molecules  to  be  derived. 
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FIGURE  CAPTIONS 


Fig.  1  -  High  Temperature  Flowing  Afterglow  Apparatus 
Fig.  2  -  Rate  constants  for  the  reaction  of  with  O2  . 

The  NOAA  kinetic  energy  data  are  from  McFarland  et.al.  (10)  and  the  temperature 
data  are  from  Lindinger  et  al.  (2).  kt  is  the  Langevin  collisional  rate  constant. 
“Maximum  for  v>0”  assumes  that  the  v=0  rate  constants  are  given  by  “NOAA  (KE)” 
and  that  all  vibrationally  excited  O2  reacts  at  the  Langevin  rate. 

Fig.3  -  Rate  constants  for  the  reaction  of  N2  with  O2  .  The  NOAA  (KE)  kinetic 
energy  data  are  from  McFarland  et  al.(lO)  and  the  NOAA  (T)  temperature  data  are 
from  Lindinger  et  al.  (2). 

Fig  4  -  Rate  constants  for  the  reaction  of  N2  with  O2  as  a  function  of  rotational  and 
vibrational  energy.  The  curves  marked  v  >  0  and  v  >  1  are  derived  rate  constants  for 
vibrationally  excited  O2  as  described  in  the  text. 

Fig  5  -  Rate  constants  for  the  reaction  of  O'^  with  N2 . 

The  NOAA  temperature  data  are  from  Lindinger  et.  al.  (2)  and  the  drift  tube  data  from 
Albritton  et  al.  (21).  The  predicted  values  are  described  in  the  text. 

Fig  6  -  Rate  constants  for  the  reaction  of  O"^  with  O2  .  The  NOAA  temperature  data 
are  taken  from  Lindinger  et  al.  (2)  and  Ferguson  (26).  The  NOAA  kinetic  energy  data 
are  from  McFarland  et  al.  (10)  and  J&B  refers  to  drift  tube  work  of  Johnsen  and 
Biondi.  (22). 

Fig  7  -  Rate  constants  for  the  reaction  of  O"^  with  O2  .  The  NOAA  kinetic  energy  data 
are  from  McFarland  et  al.  (10)  The  solid  lines  through  the  data  represent  fits  to  the 
expression  k  =  A,  (300  /  T)"  +  A2  exp(-£^  /  kT) .  The  derivation  of  the  rate  constants 
for  O2  (v>0)  is  described  in  the  text. 


Fig  8  -  Rate  constants  for  the  reaction  of  ions  with  NO.  Rennes  data  are  from  Le 
Garrec  et  al.  (30),  NOAA  data  from  Albritton  et  al.  (21)  and  Pittsburgh  data  from 
Graham  et  al.  (29). 

Fig  9  -  Rate  constants  for  the  reaction  of  ions  with  NO.  Rennes  data  are  from  Le 
Garrec  et  al.  (30).  The  solid  line  through  the  data  represents  a  fit  to  the  expression 
k  =  /li(300  /  Tf  +  4  exp(-£'^  /  kT) . 

Fig  10  -  Rate  constants  for  the  reaction  of  ions  with  SO2.  NOAA  data  are  from 
Dotan  et  al.  (34). 

Fig  1 1  -  Rate  constants  for  the  reaction  of  ions  with  CO2.  NOAA  data  are  from 
Dotan  and  Lindinger  (35). 

Fig  12  -  Rate  constants  for  the  reaction  of  COj  ions  with  O2.  Drift  data  are  from 
Ferguson  et  al  (36). 
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ELECTROMAGNETIC  MODELING  OF  COMPLEX  DIELECTRIC/METALLIC  MINES 

IN  A  LAYERED  EARTH 


George  W.  Hanson 
Associate  Professor 

Department  of  Electrical  Engineering  and  Computer  Science 
University  of  Wisconsin-Milwaukee 


Abstract 

An  accurate  electromagnetic  model  of  a  buried  dielectric/metallic  mine  has  been  developed 
as  an  aid  to  the  development  of  target  detection  and  identification  technologies  for  military  and 
humanitariam  weapons  cleanup  efforts.  An  integral  equation  model  for  the  buried  mine 
configuration  has  been  formulated  and  numerically  solved  exploiting  an  efficient  numerical 
technique  to  compute  important  electromagnetic  features.  The  implemented  method  is  applicable 
to  the  determination  of  a  mine’s  natural  resonant  frequencies  and  eigencurrents/fields,  which  can 
be  used  for  detection  and  identification  schemes.  The  influence  of  the  layered  earth  background 
environment,  into  which  the  target  (mine)  is  embedded,  on  natural  modal  characteristics  has  been 
ascertained.  Further  development  of  this  technique  (ongoing  by  current  graduate  students)  will 
yield  results  of  significant  value  in  guiding  the  design  of  future  detection  and  identification 
technologies  for  the  buried  mine  problem. 
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I.  Problem  Significance 


The  detection  of  potentially  dangerous  hidden  objects  is  of  great  importance.  Even 
specifically  considering  the  mine  problem,  there  are  many  different  scenarios  to  consider.  For 
instance,  the  detection  of  unexploded  ordnance  (UXO)  is  a  very  difficult  problem  for  military 
range  cleanup.  According  to  the  Department  of  Defense,  there  are  approximately  nine  million 
acres  of  past  and  present  test  ranges  that  contain  unexploded  ordnance  [1],  including  Air  Force 
property.  This  problem  is  of  increasing  concern  to  civilians  due  to  the  Base  Realignment  and 
Closure  (BRAC)  Program,  under  which  some  military  bases  have  been  closed  and  the  land 
returned  to  the  public  or  private  sector.  It  is  estimated  that  1 ,200-1 ,700  Formerly  Used  Defense 
Sites  (FUDS)  are  contaminated  [1].  The  possibility  of  UXOs  on  land  returned  to  private  or 
public  use  is  certainly  disturbing.  This  is  not  to  imply  that  the  military  purposely  releases  land 
with  UXO  contamination,  but  the  extent  of  the  regions  affected,  together  with  the  difficulty  of 
absolutely  assuring  removal  of  all  UXO  material  (due  to  imperfect  detection  technology)  renders 
the  release  of  contaminated  land  likely. 

Aside  from  the  possibility  of  UXO  on  FUDS  within  the  United  States,  the  detection  of  land 
mines  in  foreign  countries  is  a  significant  problem  for  humanitarian  cleanup  efforts.  There  are 
now  over  50  million  abandoned,  yet  active,  land  mines  in  over  60  countries  around  the  world 
[2].  These  mines  kill  some  10,000  civilians  each  year,  and  injure  many  more.  Unfortunately, 
in  many  countries  mines  continue  to  be  set  at  an  astounding  rate.  Current  technology  for  mine 
detection  is  unsatisfactory,  being  imprecise,  dangerous,  and  expensive. 

Separate  from  the  buried  mine  issue,  the  identification  of  hidden  objects  is  of  great  concern 
for  safety  in  airports,  courtrooms,  prisons,  etc..  Current  detectors  are  only  useful  for 
determination  of  the  presence  of  sufficient  quantities  of  metallic  material.  Since  non-metallic 
materials  are  being  incorporated  into  weapons  to  avoid  detection,  technologies  to  detect  and 
identify  such  materials  need  to  be  improved. 

The  above  described  problems  represent  an  important  subset  of  the  wide  range  of  scenarios 
which  require  detection  and  identification  technology  improvement.  Due  to  the  considerable 
variation  of  object  shapes  and  material  properties  which  must  be  detected  and  identified,  and  the 
range  of  background  environments  into  which  the  objects  may  be  embedded,  no  single 
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technology  will  prove  to  be  completely  satisfactory.  In  a  general  situation,  it  may  be  necessary 
to  employ  a  suite  of  technologies,  based  on  electromagnetic,  acoustic,  chemical,  and  nuclear 
mechanisms,  to  achieve  accurate  detection  and  identification.  Even  within  the  electromagnetic 
domain,  for  example,  a  range  of  approaches  may  be  required.  A  combination  of  low  frequency 
(object  small  compared  to  wavelength),  resonance  (object  comparable  to  wavelength),  and  high 
frequency  (object  large  compared  to  wavelength)  frequency-domain  methods  may  be  useful,  in 
addition  to  wideband  and  transient  methods.  Within  the  above,  detection  and  identification  can 
be  based  upon  singularity  characteristics  (magnetic  or  electromagnetic)  or  actual  electromagnetic 
imaging  (optical,  microwave,  etc.),  although  the  latter  is  generally  a  more  difficult  problem. 

II.  Project  Description 

The  work  performed  here  concerns  the  electromagnetic  modeling  of  primarily  dielectric 
bodies  embedded  in  an  inhomogeneous  background  environment.  The  specific  problem  of  buried 
dielectric  mines  is  addressed  here,  although  with  slight  modification  other  objects/environments 
can  be  considered.  The  type  of  mine  considered  is  primarily  dielectric,  with  small  amounts  of 
metallic  material  incorporated  to  model  springs  or  wires  which  are  found  on  some  mines.  The 
proposed  method  of  analysis  is  fullwave,  and  can  be  applied  to  a  range  of  frequencies  for  the 
determination  of  singularities  and  scattering  response,  which  can  be  used  for  detection  and 
identification  schemes  [3], [4].  The  principal  idea  is  to  determine  the  electromagnetic 
characteristics  of  interest,  such  as  natural  frequencies,  and  the  influence  of  the  background 
environment  on  those  quantities.  Given  a  specific  shape  of  interest,  the  influence  of  the 
background  material  layering  and  constitutive  parameters  on  the  resonance  characteristics  can 
be  determined.  The  determination  of  natural  frequencies  allows  one  to  determine  the  correct 
frequency  range  which  will  provide  a  strong  response  from  the  object,  and  the  natural  modes 
(natural  currents  or  fields)  provide  information  as  to  what  type  of  waveform  polarization  is 
necessary  to  strongly  excite  the  body  [5].  The  influence  of  the  background  environment  on  the 
object’s  natural  characteristics  is  important  to  consider,  since  the  object  will  rarely  be  in  free- 
space,  physically.  It  may  happen,  though,  that  the  object  will  behave  electrically  as  if  it  were 
in  free  space  (or  a  homogeneous  space)  in  some  situations.  In  other  situations,  the  background 
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environment  will  need  to  be  accounted  for  explicitly. 


III.  Brief  Overview  of  Previous  Work 

There  has  been  a  considerable  amount  of  interest  in  the  development  of  appropriate 
techniques  for  detecting  and  identifying  buried  objects  or  bodies.  Geophysical  sensing 
technologies  have  been  developed  for  the  detection  and  identification  of  various  types  of 
geological  bodies  or  features,  such  as  underground  oil  reserves,  mineral  deposits,  and  other 
structures  of  interest.  The  detection  and  identification  problem  for  the  buried  mine,  either 
conducting  or  nonconducting,  has  received  less  attention.  Current  metallic  mine  detectors  use 
standard  "metal  detector"  technology,  which  is  inaccurate  in  the  detection  phase  and  provides 
no  identification  capability.  Dielectric  mines  provide  a  more  difficult  problem,  and  no  system 
has  been  developed  which  operates  in  a  satisfactory  manner.  A  brief  compilation  of  research 
papers  is  provided  [6]-[ll],  which  detail  some  recent  work  in  this  direction.  It  should  be  noted 
that  [10], [11]  are  particularly  relevant  to  the  problem  and  solution  method  proposed  here.  This 
work  represents  a  significant  extension  of  previous  efforts  in  several  ways.  The  model 
developed  is  an  exact,  three-dimensional  description  of  the  problem,  including  an  arbitrary  multi¬ 
layered  earth  with  a  general  dielectric  body.  Most,  but  not  all,  previous  efforts  considered  a 
two-dimension  problem.  All  previous  work  utilized  a  simpler  half- space  model  of  the  earth/air 
interface,  and  neglected  other  soil  layers  which  often  exist.  The  mine  includes  metallic  material 
to  model  wires,  whereas  previous  work  has  considered  either  metallic  objects  or  dielectric 
objects,  but  no  mixed  conducting/dielectric  bodies.  The  project  particularly  concentrates  on 
natural  modal  properties,  which  are  valuable  in  the  identification  problem,  but  have  not  been 
characterized  for  the  buried  mine. 
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IV.  Theoretical  Formulation  of  Electromagnetic  Model 
A.  Formulation  of  Integral  Equation  Model 

In  this  section  a  brief  description  of  the  theoretical  formulation  for  the  considered  problem 
is  provided.  In  the  following  development,  all  electromagnetic  quantities  will  be  assumed  to 
obey  Maxwell’s  equations  in  the  two-sided  Laplace  transform  domain,  governed  by  the  transform 
pair 


where  s=Q+j<^,  and  Q,w  are  real  quantities. 

Consider  the  geometry  shown  in  Fig.l,  which  depicts  a  closed  surface  S  bounding  volume 
V  containing  a  generally  inhomogeneous  and  anisotropic  medium  characterized  by  2(F),Po  •  The 
medium  external  to  S  is  assumed  for  simplicity  to  be  isotropic  but  generally  inhomogeneous. 
The  layered  exterior  medium  case  corresponds  to  the  buried  dielectric  body  problem  assuming 
the  ground  can  be  approximated  as  consisting  of  planar  layers.  In  each  layer,  as  well  as  internal 
to  S,  material  loss  can  be  accommodated  by  defining  the  permittivity  to  be  complex  ase-*  e  +  a/s 
for  real  conductivity  o . 

A  wave  impedance  and  propagation  constant  can  be  defined  for  the  j-th  exterior  region. 


and  formally  extended  to  the  interior  region 
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^4*1^4 


Fig.  1.  General  dielectric/conducting  body  in  multilayered  media. 
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noting  the  following  combinations 


^'Z-ZsiAo* 

(4) 

Y,Z/'=5€^,V*f"‘=S? 

The  relationship  between  the  electric  field  and  an  electric  current  source  can  be  written  as 

^(nYo)=-(^ViKY,)  ;^(o) 

where  <  ,  >  denotes  integration  of  the  two  terms  separated  by  the  comma  over  common  spatial 
coordinates,  and  the  symbol  above  the  comma  indicates  the  real  scalar  product.  The  propagation 
constant  y„  is  mealy  symbolic,  and  represents  dependance  on  all  of  the  exterior  region  layers, 

i.e.,  Yo-{YpY2»Y3v..}.  The  electric  dyadic  Green’s  function,  provides  thea-th 

component  of  field  at  F  due  to  the  ^-th  component  of  a  Hertzian  current  element  at  f' .  In 
general,  the  Green’s  dyadic  can  be  written  as 

where  the  first  term  is  evaluated  in  a  principle  value  sense  for  a  specified  exclusion  volume. 
The  second  term  is  the  depolarizing  dyad  contribution,  which  depends  on  the  shape  of  the 
exclusion  volume  as  well  as  the  material  properties  of  the  region  which  includes  the  source 
current  (with  the  appropriate  value  of  propagation  constant  assigned  to  y)-  Details  concerning 
(6),  as  well  as  general  forms  of  the  principal  value  term  for  homogeneous  and  layered  media 
surround  regions  are  provided  in  Appendix  B  of  [12]. 

Next,  a  set  of  coupled  integral  equations  which  model  the  dielectric/metallic  body  completely 
contained  within  one  layer  of  the  surround  environment  can  be  formulated  [13], [14], [15].  As 
a  specific  example,  consider  the  geometry  depicted  in  Fig.  2.  A  dielectric  disk  resembling  a 
dielectric  mine  is  shown,  along  with  some  thin  metallic  pieces  that  represent  wires  or  springs. 
The  lEs  for  this  geometry  are  formed  by  enforcing  boundary  conditions  along  the  surface  of  the 
wires  and  field  conditions  in  the  dielectric  region.  The  boundary  and  field  conditions  are  given 
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Fig.  2,  Example  of  a  buried  dielectric  mine  with  arbitrary  conducting  wires. 
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by 


N 


»=1 


I = -a  •?*“(?) 


N 


»=1 


...yfel^,m=l..N 

...VfeF 


(7) 


where  extends  over  the  m’th  wire,  V  denotes  the  space  occupied  by  the  dielectric,  a  is  a  unit 
vector  tangential  to  the  wire,  c/„  is  the  scattered  electric  field  maintained  by  conduction  current 

on  the  n’th  wire,  is  the  scattered  electric  field  due  to  equivalent  polarization  current  in  the 

inhomogeneous  dielectric  region,  e  is  the  total  electric  field  in  the  dielectric  region,  and  «  is 
the  incident  electric  field  maintained  by  remote  sources  with  the  wires  and  dielectric  region 
absent,  but  in  the  presence  of  the  multilayered  background  environment.  In  the  above,  the  first 
equation  represents  2N  scalar  equations  which  force  the  total  tangential  electric  field  to  be  zero 
along  the  m’th  perfectly  conducting  wire.  The  next  equation  represents  three  scalar  equations 
which  force  the  total  electric  field  at  every  point  within  the  dielectric  region  to  equal  the  incident 
field  plus  the  field  maintained  by  the  various  currents.  The  radiation  condition  is  enforced, 
along  with  boundary  conditions  at  the  other  planar  interfaces,  by  the  Green’s  function  for  the 
background  environment. 

The  volume  equivalence  principle  is  used  to  account  for  the  inhomogeneous  dielectric  region 
embedded  in  the  background  layering  [16], [14].  The  dielectric  region  is  replaced  by  a 
homogeneous  region  containing  equivalent  polarization  currents  which  reside  in  the  space 
previously  occupied  by  the  dielectric  and  vanish  outside  of  that  region,  as  depicted  in  [15].  The 
equivalent  currents  are 

=yo)[e(x,z)  -e^/]e(x,z) 

A  coupled  set  of  IE’s  is  formed  by  substituting  the  relations  (5)  and  (8)  into  the  set  of 
boundary  and  field  conditions  (7),  resulting  in 
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(9) 


\n=l  I  V. 


m  =  l..N 


(10) 


where 

Z'’=Z‘*[€-e/]  (11) 

The  coupled  set  of  IE’s  (9)  and  (10)  can  be  solved  using  the  method  of  moments  (MoM). 
The  conduction  currents  (J^)  on  each  wire  can  be  expanded  in  overlapping  piecewise  sinusoidal 

(PWS)  functions.  The  electric  field  in  the  dielectric  region  can  be  expanded  as  entire  domain 
basis  functions.  A  galerkin  solution  is  implemented  by  testing  each  scaler  IE  with  the 
appropriate  function,  leading  to  a  matrix  system.  To  determine  natural  frequencies,  the 
impressed  field  term  is  set  to  zero  and  a  numerical  root  search  is  performed  to  find  the  complex 
value  of  <0  which  forces  the  determinate  of  the  impedance  matrix  to  vanish. 


B.  Special  Computational  Considerations 

Entire-domain  basis  functions  are  used  to  render  the  computational  solution  of  (9)  and  (10) 
tractable.  If  the  basis  functions  are  carefully  chosen,  based  upon  mathematical  and  physical 
considerations,  the  numerical  solution  of  the  integral  equation  system  can  be  made  much  more 
efficient  that  if  the  standard  subdomain  basis  functions  are  used  [17].  Entire-domain  basis 
functions  have  been  used  in  the  solution  of  conducting  wire  and  patch  models,  but  generally  have 
not  been  applied  to  the  dielectric  body  problem.  We  have  found  in  this  work  that  the  cavity-type 
entire-domain  basis  functions  utilized  here  results  in  a  small  matrix  size  which  is  computationally 
efficient.  The  entire-domain  scheme  is  also  free  of  fictitious  excess-charge  within  the  problem 
domain,  which  can  result  from  the  utilization  of  sub-domain  basis  functions.  Considering  that 
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many  mine  shapes  are  well  modeled  by  simple  geometrical  shapes  that  are  amenable  to  the 
entire-domain  approach,  it  is  not  surprising  that  this  technique  works  very  well  for  the  buried 
mine  problem  considered  here. 

V.  Results 

The  integral  equation  (7)  for  a  dielectric  region  embedded  in  multilayered  media  was 
numerically  solved  using  a  set  of  entire-domain  cavity  modes.  The  cavity  modes  should  be  valid 
for  expanding  the  electric  fields  for  the  cases  €/€.>!  (PMC  wall  approximation)  and  €/ej<l  (PEC 
wall  approximation).  Results  for  rectangular  regions  and  cylindrical  regions  have  been  obtained 
for  each  case,  and  results  are  being  prepared  for  journal  publication.  Several  examples  will  be 
included  here  to  illustrate  the  accuracy  and  efficiency  of  the  method.  Fig.  3  shows  a 
convergence  plot  for  the  complex  resonant  frequency  of  a  rectangular  dielectric  region  (39.5  x 
23.0  X  13.5  cm).  The  dielectric  inside  the  region  has  e  =2.5€o,  whereas  the  surrounding  region 
has  €  =  12.0ej, .  PEC  modes  were  used  for  field  expansion  in  a  Galerldn  solution.  It  can  be  seen 
that  the  resonant  frequency  converges  very  quickly  with  increasing  the  number  of  basis 
functions,  resulting  in  efficient  computation.  Fig.  4  shows  similar  results  for  a  PMC  case,  (7.45 
X  7.45  X  2.98  cm)  with  the  inner  region  characterized  by  €=79.46€p,  whereas  the  surrounding 
region  has  €=€„.  Fig.  5  shows  results  for  a  cylindrical  PMC  resonator  (radius  5.25  mm, 
height=4.6  mm,  6=38eo)  residing  in  air.  Fig.  6  provides  an  indication  of  the  accuracy  of  the 
method.  A  cylindrical  resonator  (radius=4.55  mm,  height=4.04  mm,  e=37.1ep)  is  positioned 
on  a  grounded  dielectric  layer  of  thickness  h  having  e=222€^.  Results  are  compared  with 
another  author’s  theoretical  results  (cross)  and  experimental  results  (triangle)  [18].  Our  method 
is  seen  to  be  in  good  agreement  with  measurements. 
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Rectangular  DR  (PEC) 


Rectangular  DR  (PMC) 


Fig.  4 
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Resonant  Freq.  and  Q  Factor  of  Isolated  DR 


No.  of  Basis  Func. 


Fig.  5 
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Resonant  Freq.  and  Q  Factor  of  DR  on  Grounded  Slab 


Fig.  6 
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Abstract 

In  this  paper  we  present  a  model  for  the  computation  of  the  structure  function  parameters  [1]  and 
the  local  Richardson  number  [2]  for  stratospheric  flow.  Due  to  the  nature  of  aircraft  data  available  the 
model  applies  Boussinesq-Kolmogorov  turbulence  modeling  hypothesis  to  recover  the  (vertical)  gradients 
of  the  flow.  We  test  the  predictions  of  the  model  using  the  data  collected  by  NASA  during  the  1989 
AASE  mission  [3]. 


10~2 


1  Introduction 

For  many  pratctical  applications  it  is  useful  to  know  the  refractive  index  structure  function  [1,  4] 

[n(x,  t)  -  n(x  +  r,  t)]^ 

where  n(x)  is  the  fluctuating  refractive  index.  In  the  inertial  range  this  structure  function  depends  only 
on  r  and  has  the  form  is  referred  to  as  the  ‘refractive  structure  function  parameter*).  In  the 

absence  of  humidity  (e.g.  in  the  high  stratosphere)  the  fluctuations  in  n(x)  are  due  to  the  fluctuations  in 
the  potential  temperature  0  and  C%  ~  where  Cy  is  the  structure  function  parameter  for  the  potential 
temperature  [1].  In  the  inertial  range  is  related  to  the  temperature  spectra  by 

#T(fc)  =  0.25(7^jfc-®/^  =  (1.1) 

where  k  is  the  wave  number  and  e,  xe  are  the  molecular  destruction  rates  of  the  turbulent  kinetic  energy  and 
potential  temperatur,  respectively.. 

Using  the  results  obteuned  in  [1]  for  the  planetary  boundary  layer  P  ~  0.4  and  hence 

Cl.  =  (1-2) 

Following  the  model  presented  in  [1]  X9  and  e  are  related  to  the  flow  gradients  and  cospectra  by  the  following 
relations 

Xe  =  -2ew^  (1.3) 


_du  _dv 

e  =  —uw— - vw— — h  —6w 

dz  az  To 


(Upper  case  variables  U,V,W  represent  components  of  the  large  scale  flow  and  0  the  mean  potential  tem¬ 
perature.  Lower  case  variables  represent  the  fluctuating  turbulent  residuals  of  these  quantities). 

We  observe  that  the  average  value  of  Cj,  can  be  obtained  from  spectral  analysis  (using  eq.  (1.1)).  However 
the  ’local’  values  of  Cj.  (which  depend  on  local  averages  as  expounded  in  [1])  require  the  computation  of 
X0,e  and  therefore  the  evaluation  of  the  vertical  gradients. 

Similarly  the  Richardson  number  Ri  which  measures  the  stability  of  the  geophysical  flow  against  tem¬ 
perature  stratification  [2]  is  defined  as 


and  its  computation  also  requires  the  vertical  flow  gradients. 

In  the  past  aircraft  data  collection  missions  to  the  stratosphere  [3,  5,  6]  used  one  meteorological  probe. 
Accordingly  they  provide  only  the  flow  gradients  along  the  horizontal  flight  path.  Our  objective  in  this 
paper  is  to  present  a  model  which  surmount  this  difficulty  and  thus  allows  us  to  calculate  the  local  values 
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of  and  Ri  in  the  stratosphere.  To  this  end  the  model  utilizes  the  Boussinesq-Kolmogorov  hypothesis  [7] 
coupled  with  the  Smagorinski  model  for  the  turbulent  eddy  viscosity  [8]. 

To  test  this  model  against  actual  data  we  use  the  AASE  5  Hertz  data  which  was  collected  by  the  1989 
NASA  ER2  arctic  mission  [3,  9].  We  note  that  this  data  set  have  several  shortcomings  (which  we  discuss  at 
length  in  Section  3).  Nevertheless  we  shall  show  that  by  using  appropriate  filtering  techniques  this  data  can 
be  used  at  least  as  a  ’’testing  ground”  for  the  model  presented  in  this  paper. 

The  plan  of  the  paper  is  as  follows:  In  Section  2  we  present  the  model  for  the  computation  of  the  flow 
gradients  and  the  local  values  of  and  Ri.  In  Section  3  we  discuss  the  AASE  data,  its  shortcomings  and 
its  decomposition  (using  the  Karhunen-Lo^ve  (K-L)  algorithm  [8,  19])  into  mean,  wave  and  turbulent  com¬ 
ponents.  In  particular  we  apply  several  tests  to  confirm  that  the  “turbulent  component”  actually  represents 
turbulence.  Using  these  turbulent  residuals  we  then  compute  and  Ri  along  the  flight  path  of  the  ER2. 

2  Computation  of  the  flow  gradients 

The  statistical  approach  to  turbulence  modeling  splits  the  flow  variables  U,  f  (where  f  is  the  temperature) 
into  a  sum 


U  =  U-l-U'  +  u,  t  =  T  +  T'  +  t  (2.1) 

where  U,T  represent  the  mean  (large  scale)  flow,  U',T'  represent  waves  and  u,t  “turbulent  residuals”  [16]. 
The  turbulent  residuals  are  then  used  to  form  the  averaged  Reynold’s  stress  tensor 


^ UiUj , 


(2.2) 


(In  the  following  we  use  {x,y,z)  and  (a;i,a;2ja:3)  interchangeably  to  represent  the  spatial  coordinates). 
Boussinesq-Kolmogorov  (B-K)  postulate  [7]  relates  tij  and  to  the  gradients  of  the  mean  flow  as  fol¬ 
lows: 


Uj  =  -  VT 


raUi  dUj\ 
dxi  ) 


(2.3) 


where 


Uit  = 


(2.4) 


K  =  Tu  =  Til  +  T22  +T33. 


(2.5) 


In  these  equations  k  is  a  constant  and  vt  is  the  turbulent  eddy  viscosity. 

FVom  the  data  collected  by  an  airplane  flying  along  a  “straight  horizontal”  line  in  the  stratosphere  (which 
we  take  as  the  x-axis)  we  can  compute  (after  an  appropriate  decomposition  of  the  data)  the 


10-4 


six  independent  components  of  r^-  and  Uit.  Eqs  (2.3),  (2.4)  and  the  continuity  equation  for  incompressible 


can  be  used  then  (at  least  in  principle)  to  evaluate  ut  and  the  (“missing”)  gradients  that  are  needed  to 
compute  Cji  and  Ri 

To  see  how  this  is  done  we  first  write  eqs  (2.3)  explicitly; 


m  =  2z/t 


T22  —  2z/r 


© 


—  T33  =  2vt 


dy  ^  dx  ) 
^  dW" 
dz  dx  , 


(2.7b) 


(2.7d) 


(du 


T33  =  +  (2.7f) 

observe  that  the  averages  referred  to  in  these  equations  should  be  interpreted  as  local  averages  (in  a  “short 
window”)  around  the  point  x  under  consideration. 

Since  ^  is  known  we  can  use  (2.7a)  to  compute  vt-  Then  from  (2.7b),  (2.7d),  (2.7e)  we  compute  ^ 
and  The  continuity  eq.  (2.6)  and  eq.  (2.7c)  provide  then  two  methods  to  compute  and  thus  act  as 
a  check  for  the  validity  of  B-K  hypothesis  for  the  flow  under  consideration. 

Finally,  eq.  (2.7f)  remains  as  one  equation  in  two  unknowns.  To  resolve  this  dfficulty  we  shall  assume 


(i.e.  assume  that  the  gradient  of  W  in  the  x-y  plane  is  isotropic)  and  this  will  allow  us  to  compute  an 
estimate  for  We  note  that  this  last  quantity  is  utilized  only  in  the  last  step  of  the  computation  of  Ri. 

As  to  the  temperature  gradients  we  use  (2.4)  to  obtain 

dz  IR  dx'  ^  ’ 

The  attempt  to  implement  the  algorithm  described  above  numerically  for  actual  geophysical  flows  yields 
however  strong  variations  in  the  value  of  ut  (and  hence  for  the  gradients).  To  overcome  this  difficulty  we 
introduced  a  “more  stable”  numerical  scheme  to  compute  i/t*  This  is  accomplished  by  using  Smagorinski 
eddy  viscosity  model  [8]  where 


VT  -  f{2DijDijY/\  Dij  -  2  +  9x0  ■ 
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To  motivate  this  choice  we  observe  that  aircraft  measurements  of  a  flow  at  fixed  intervals  can  be  considered 
as  “large  eddy  measurements”  as  no  information  is  available  about  the  flow  in  between  the  grid  points. 

Eq.  (2.10)  has  the  advantage  of  utilizing  all  the  components  of  in  the  computation  of  vt-  In  fact 
from  eq.  (2.3)  we  have 


Av^DijDij 


(2.11) 


Using  (2.10)  to  eliminate  (DijDij)  in  (2.11)  leads  then  to 


(2.12) 


In  this  equation  C  is  of  the  order  of  the  “grid  size”  (i.e.  interval  between  measurements)  which  introduces 
a  parameter  in  the  computation  of  vt-  Using  eq.  (2.12)  to  compute  i/j  we  now  have  two  ways  to  evaluate 
one  directly  from  the  data  and  the  other  from  eq.  (2.7a).  Once  again  this  can  be  used  to  verify  the 
validity  of  the  B-K  hypothesis  for  the  given  data  set. 

Two  strategies  can  be  used  now  to  compute  C^.  The  first  one  is  to  follow  Wyngaard-Kosovic  model  [1]. 
In  this  model 


Cl  =  a 


we^ 

rA^/^ 


(2.13) 


where  r  is  the  local  kinematic  shear,  A  is  the  local  Monin-Obukov  length  and  a  is  a  numerical  parameter. 
(Based  on  PBL  data  a  was  set  to  19  in  [1]). 


r  =  wuP  +  v'uP' 


(2.14) 


_r3/2 

^  "  /3(5/T)^ 

(/?  is  the  Von-Karman  constant).  The  only  modifications  we  applied  to  this  scheme  for  stratospheric  flow  is 
to  redefine  r  to  contain  all  the  strain  components,  i.e. 


=  uvj^  -h  V'liP'  -f  uv^. 


(2.16) 


We  also  determined  a  so  that  the  mean  value  of  C\  is  equal  to  the  value  obtained  from  spectral  estimates, 

(a -0.1). 


It  should  be  observed  that  in  this  scheme  we  do  not  utilize  the  flow  gradients  directly  to  compute 
However  the  second  algorithm  to  compute  which  is  based  on  eqs.  (1.2),  (1.3),  (1.4)  requires  these 
gradients. 

As  to  the  computation  of  Ri  we  use  eqs.  (1.5),  (2.9)  but  note  that 


dz  T  \dz  CpJ  ‘ 

In  summary  we  presented  here  a  model  for  the  computation  of 
contains  two  parameters  a  and 


(2.17) 

and  Ri  from  aircraft  data.  This  model 
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3  A  case  study 


Previous  studies  of  the  stratosphere  and  its  structure  parameters  were  conducted  by  many  researchers  e.g. 
Lilly  et  al  [5],  Woodman  and  Rastogi  [13],  Fakao  et  al  [14]  and  Dewan  [12]. 

In  this  section  however  we  restrict  ourselves  to  one  case  study  where  we  apply  the  model  described  above 
to  compute  the  local  values  of  and  Ri  in  the  stratosphere.  To  this  end  we  use  the  data  collected  by  the 
ER2  flight  on  Feb.  9,  1989  during  the  AASE  mission  to  the  arctic.  We  note  that  the  AASE  data  as  well 
as  that  obtained  from  latter  missions  was  analyzed  recently  by  Bacmeister  et  al  [9].  (This  paper  gives  a 
detailed  description  of  this  data  and  also  contains  a  lengthy  list  of  references  which  we  shall  not  duplicate 
here). 

Two  objections  might  be  raised  against  the  use  of  the  AASE  data  in  our  context.  First  there  are  concerns 
about  its  accuracy  especially  for  frequencies  above  1  Hertz  [15].  Second  some  previous  studies  [5]  concluded 
that  the  outer  scale  of  turbulence  in  the  stratosphere  must  be  of  the  order  of  10-50m  [5]  while  the  resolvable 
scale  in  the  AASE  data  is  80m. 

Our  answer  to  the  first  objection  is  that  our  results  in  this  case  study  should  be  viewed  as  preliminary 
(and  we  can  only  hope  that  better  data  will  improve  our  results).  To  overcome  the  second  objection  (at  least 
partially)  we  shall  apply  the  K-L  decomposition  [18,  19]  to  the  data  and  demonstrate  that  this  technique 
can  be  used  to  separate  it  into  mean,  wave  and  turbulent  residuals.  Several  tests  [10,  11]  are  then  used  to 
verify  that  the  turbulent  residuals  so  obtained  actually  represent  turbulence  in  the  stratosphere. 

3.1  K-L  decomposition 

The  K-L  decomposition  algorithm  (or  PC  A)  was  used  by  many  researchers  (for  a  review  see  [19]).  Here  we 
shall  give  only  a  brief  overview  within  our  context. 

Let  be  given  a  time  series  X  (of  length  N)  of  some  geophysical  variable.  We  first  determine  a  time  delay 
r  for  which  the  points  in  the  series  are  decorrelated.  Using  r  we  create  n  copies  of  the  original  series 

X(k),  X(fc  +  r),...  ,X(fc  +  (n-l)r). 

(To  create  these  one  uses  either  periodicity  or  choose  to  consider  shorter  time-series).  Then  one  computes 
the  auto- covariance  matrix  R  =  {Rij) 

N 

Rij  =  E  X{k  +  ir)X{k-[-jr).  (3.1) 

k=l 

Let  Ao  >  Al, . . .  >  An-i  be  the  eigenvalues  of  R  with  their  corresponding  eigenvectors 

i  =  0, ...  ,n- 1. 

The  original  time  series  T  can  be  reconstructed  then  as 

n—l 

Xij)  =  '£^k{M  (3-2) 

*!=0 


10-7 


where 


1 

n  ■  (3-3) 

^  *=0 

The  essence  of  the  K-L  decomposition  is  based  on  the  recognition  that  if  a  large  spectral  gap  exists  after  the 
first  mi  eigenvalues  of  R  then  one  can  reconstruct  the  mean  flow  (or  the  large  component)  of  the  data  by 
using  only  the  first  mi  eigenfunctions  in  (3.2).  A  recent  refinement  of  this  procedure  due  to  Ghil  et  al  [18] 
is  that  the  data  corresponding  to  eigenvalues  between  mi  +  1  and  up  to  the  point  m2  where  they  start  to 
form  a  “continuum”  represent  waves. 

Thus  the  original  data  can  be  decomposed  into  mean  flow,  waves  and  residuals  (i.e.  data  corresponding 
to  eigenvalues  m2  +  1, . . .  ,n  -  1  which  we  wish  to  interpret  at  least  partly  as  turbulent  residuals). 

For  the  data  under  consideration  we  carried  out  this  decomposition  using  a  delay  r  of  1024  points  (approx¬ 
imately  40km  separation)  for  all  the  geophysical  variables.  The  resulting  eigenvalues  and  data  decomposition 
for  the  temperature  is  presented  in  Figs.  1-4.  Figs.  5,6  present  the  Fourier  transform  of  the  wave  and  tur¬ 
bulent  components.  In  this  decomposition  n  =  96,  mi  =  6  and  m2  =  14.  Similar  decompositions  were 
carried  out  for  the  components  of  the  velocity  field  (and  they  confirm  the  validity  of  B-K  hypothesis  for 
stratospheric  flow) . 

3.2  The  interpretation  of  the  residuals 

The  residuals  of  the  time  series  which  are  reconstructed  as 

n-l 

fc=m2+l 

contain  (obviously)  the  measurement  errors  in  the  data.  However  we  shall  show  that  primarily  they  should 
be  interpreted  as  representing  turbulence.  To  this  end  we  utilize  the  tests  devised  by  Axford  [10]  and  Dewan 
[11].  According  to  these  tests  turbulence  data  is  characterized  by  low  coherence  between  u,v,w  and  a  phase 
close  to  zero  or  tt  between  w  and  6.  (A  phase  close  to  7r/2  is  characteristic  of  waves).  Figs.  7,8,9  demonstrate 
that  the  coherence  between  the  residuals  of  u,v,w  is  less  than  0.1.  Fig.  10  gives  a  scatter  plot  of  the  phase 
between  w  and  6.  This  figure  is  less  definitive  as  there  are  still  quite  a  few  points  in  the  wave  sector  {tt/A,  ^). 
However  out  of  the  150  points  in  this  plot  105  are  in  the  “turbulence  sector”. 

These  tests  show  that  to  a  large  extent  the  residuals  that  were  obtained  from  the  K-L  decomposition 
represent  actual  3-D  turbulence. 

3.3  Computation  of  C^,  Ri 

Using  the  residuals  obtained  above  and  by  applying  the  algorithm  described  in  Section  2  we  can  now  compute 
Ct  and  Ri  along  the  flight  path  of  the  ER2.  The  results  are  shown  in  Figs.  11,12.  It  is  evident  from  these 
figures  that  there  exist  large  local  variations  in  the  values  of  and  Ri  in  the  stratosphere.  These  variations 
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are  totally  masked  when  an  averaged  value  for  these  parameters  is  computed  from  the  whole  time-series. 
(These  averaged  values  are  =  0.3.10”^,  Ri  =  0.14).  These  results  confirm  previous  studies  [12,  17] 
which  suggested  that  the  flow  in  the  stratosphere  is  strongly  stably  stratified  with  intermittent  thin  patches 
of  3-D  turbulence. 
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Abstract 


Space-based  inflatable  technology  is  of  current  interest  to  NASA  and  DOD,  and  in 
particular  to  the  Air  Force  and  Phillips  Laboratory.  Potentially  large  gains  in  lower  launch  costs, 
through  reductions  in  structure  mass  and  volume,  are  driving  this  activity.  Diverse  groups  are 
researching  and  developing  this  technology  for  radio  and  radar  antennae,  optical  telescopes,  and 
solar  power  and  propulsion  applications.  Regardless  of  the  use,  one  common  requirement  for 
successful  application  is  the  accuracy  of  the  inflated  surface  shape.  The  work  reported  here 
concerns  a  preliminary  nonlinear  finite  element  and  experimental  analysis  of  shape  control  of  an 
inflated  thin  circular  disk.  Shape  modification  was  achieved  through  enforced  boundary 
displacements,  which  resulted  in  moving  the  inflated  shape  towards  a  desired  parabolic  profile. 
Conclusions  and  recommendations  are  provided. 
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SHAPE  CONTROL  OF  AN  INFLATED  THIN  CIRCULAR  DISK: 

Experimental  Investigation 

Christopher  H.  M.  Jenkins 

1.  INTRODUCTION 

There  currently  exists  renewed  interest  in  applications  of  inflatable  structures  in  space. 
With  increased  pressure  to  reduce  costs  associated  with  design,  fabrication,  and  launch  of  space 
structures,  DOD  and  NASA  are  taking  a  new  look  at  space-based  inflatable  structures. 
Applications  for  inflatable  structures  in  space  include  lunar  and  planetary  habitat,  RF  reflectors 
and  waveguides,  optical  and  IR  imaging,  solar  concentrators  for  solar  power  and  propulsion,  sun 
shades,  and  solar  sails  [see,  e.g.,  Grossman  and  Williams,  1990;  Cassapakis  and  Thomas,  1995]. 

For  many  of  these  applications,  particularly  those  involved  with  communications, 
imaging,  power,  and  propulsion,  accurate  maintenance  of  the  inflatable  surface  shape  is  critical. 
Surface  accuracy  ranges  from  a  few  percent  of  wavelength  for  low  frequency  applications,  to  a 
fraction  of  a  percent  of  wavelength  for  visual  wavelengths  [see  Hedgepeth,  1982].  The 
likelihood  of  achieving  such  tolerances  by  purely  passive  means  is  small. 

For  example,  the  AF  Phillips  Laboratory  has  recently  undertaken  the  task  of  creating  a  large 
optical  quality  membrane  telescope.  The  membranes  on  these  telescopes  will  range  in  thickness 
from  10  -  150  micrometers.  The  maximum  acceptable  peak-to-pealc  figure  error  over  the  entire 
surface  will  range  from  10-20  micrometers.  (This  value  assumes  that  a  certain  amount  of 
secondary  adaptive  optics  will  be  used  to  correct  image  errors.)  Passive  and  active  means  for 
reducing  the  figure  error  are  imder  investigation. 

In  what  follows,  background  information  on  highly  compliant  structures  and  their 
analysis  is  first  given.  Then  we  review  the  problem  of  the  inflation  of  an  initially  plane,  circular 
membrane,  with  deformations  limited  to  those  that  admit  only  small  strains  but  moderately  large 
rotations.  (We  purposefully  exclude  here  any  consideration  of  the  associated  problems  of  the 
inflation  of  annular  membranes  or  circular  plates,  or  of  large  strain  deformation;  see  Jenkins  and 
Leonard  [1991],  and  Jenkins  [1996a]  for  additional  details  on  these  topics.)  Next  we  discuss  the 
use  of  finite  element  analysis  to  demonstrate  the  effects  of  boundary  (rim)  control  on  the  deformed 
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shape  of  an  inflated  circular  membrane.  Experimental  work  is  then  described  which  correlates  with 
the  FEM  analysis.  Results  are  presented,  and  finally  conclusions  are  provided. 

Background 

Structures  formed  from  thin-walled  material  can  be  found  in  a  diverse  array  of 
applications,  from  biological  organisms  to  architectural  structures  to  aerospace  craft.  Depending 
on  the  degree  of  bending  resistance  inherent  in  such  structures,  as  well  as  the  degree  of 
participation  of  in-plane  and  bending  reactions  to  loads,  for  certain  analytical  puiposes  they  may 
be  modeled  successfully  as  membranes,  whose  unique  feature  is  the  absence  of  any  resistance  to 
bending. 

For  structures  whose  bending  resistance  is  very  small  indeed  (and  we  take  here  as 
fundamental  that  all  real  structures  have  some  finite  bending  resistance,  no  matter  how  small), 
compressive  stresses  result  in  a  localized  buckling  or  wrinkling.  Hence,  a  further  modeling 
restriction  can  be  made  that  limits  the  membrane  to  allowable  stresses  greater  than  zero  only,  i.e., 
a  no-compression  membrane.  The  structure  is  then  modeled  as  carrying  loads  only  in  tension  or 
in  tension  fields  [see  Jenkins  and  Leonard,  1991;  Jenkins,  1996a]. 

Finite  element  methods  have  made  significant  contributions  to  the  analysis  of  large 
deformation,  nonlinear  structural  problems.  Yet  the  analysis  of  membrane  structures  that  undergo 
large  displacements  during  loading  remains  a  difficult  problem  for  which  current  finite  element 
methods  are  unsuitable.  The  primary  difficulties  lie  in  the  fact  that  the  structural  system  is 
underconstrained,  and  stable  equilibrium  conditions  only  exist  for  loading  fields  that  are  orthogonal 
to  the  set  of  unconstrained  degrees  of  freedom  (see  Kuznetsov,  1991).  Furthermore,  configurations 
exist  that  are  not  in  the  vicinity  of  such  an  equilibrium  state.  Consequently,  large  rigid  body 
motions  with  concomitant,  large  configuration  changes,  must  take  place  before  elastic  response  is 
obtained. 

A  class  of  pneumatic  envelopes  provides  dramatic  examples  of  underconstrained  systems: 
the  stowed  versus  deployed  configurations  in  air  bags,  balloons,  parachutes,  or  reflectors.  There  are 
several  difficult  aspects  of  the  analysis  of  pneumatic  envelopes.  First  of  all,  while  structural 
stiffness  is  provided  by  structural  components  such  as  the  skin,  and  any  reinforcing  elements  such 
as  cables  or  tapes,  this  stiffness  can  only  be  engaged  in  speeific  loaded  configurations,  and  the 
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stability  of  the  structural  system  only  exists  under  such  loading.  The  structural  system  is  thus  said 
to  be  underconstrained. 

Furthermore,  the  unloaded  configuration  may  be  far  from  any  static  equilibrium  condition 
in  solution  space.  Constraints  are  usually  imilateral  so  that  a  structurally  impossible  state  is 
obtained  under  a  reversal  of  the  load  from  an  equilibrium  state. 

Even  in  a  loaded  configuration,  the  structural  function  of  the  skin  may  degenerate  from  a 
state  of  bi-axial  tension,  for  which  the  membrane  model  is  appropriate,  to  a  state  of  uni-axial 
tension  (since  compression  cannot  be  accommodated  by  thin  films  or  cables).  Under  this  latter 
condition  the  skin  will  wrinkle.  This  condition  may  be  appropriately  modeled  by  a  tension  field 
model  which  ignores  the  detail  of  the  ’wrinkling  but  properly  models  the  structural  performance  of 
the  skin  [Jenkins  and  Leonard,  1993].  Experimental  verification  is  a  challenge  under  such 
conditions,  since  any  contact  with  the  structure  readily  disturbs  that  which  was  to  be  measured 
[Jenkins,  1995;  Jenkins,  et  al.,  1995;  Jenkins,  1996b]. 

2.  MECHANICS  OF  INFLATED  MEMBRANES 

The  problem  of  the  inflation  of  an  initially  plane  membrane  with  circular  boundary  begins 
■with  Hencky  [1915].  Earlier,  Foppl  [1907]  had  arrived  at  equilibrium  equations  for  a  membrane 
plate.  These  equations  were  essentially  modified  von  Karman  plate  equations  [1910]  with  the 
bending  rigidity  set  to  zero.  Figure  2-1  represents  a  plane  elastic  sheet  with  circular  boundary  of 
radius  a.  The  geometry  is  rotationally  symmetric  (axisymmetric);  we  also  assume  the  loading  to  be 
a  symmetric  pressure  p.  The  sheet  has  modulus  E,  thickness  h,  and  Poisson's  ratio  v. 
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Figure  2-1.  Definition  Sketch. 


The  von  Karman  plate  equations  are  based  upon  strains  which  have  nonlinearity  introduced 
through  rotations  (squares  of  slope): 


du  ^  1  ^ dw^' 
dr  2\  dr  J 


(1) 


II 

r 

where  u  and  w  are  displacements  in  the  r  and  z  directions,  respectively. 

The  von  Karman  plate  equations  in  axisymmetric  fonn  can  be  witten  as: 
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where  D  =  Eh^/12(l-v^)  is  the  bending  rigidity,  and  O  is  a  stress  function  to  which  the  stress 
resultants  in  the  radial  and  circumferential  direction  are  related  respectively  by; 


1  JO  J^O  d  /  V 

r  dr  dr  dr 


Linear  elastic  constitutive  relations  are  assumed.  It  is  important  to  note  that  the  above  equations  are 
not  geometrically  exact,  in  that  several  assumptions  have  been  made  with  regard  to  the  geometry  of 
the  deformed  configuration. 

Hencky  provided  a  solution  of  the  above  for  the  case  of  zero  boundary  displacements  (u  =  0 
=  w  at  r  =  a)  and  D  =  0  by  assuming  power  series  for  stresses  and  transverse  displacement.  Little 
interest  in  the  Hencky  problem  seems  to  have  taken  place  until  the  1940's,  when,  in  conjunction 
with  a  study  on  air-supported  roofs,  Stevens  [1944]  performed  an  experimental  investigation  on  a 
10  inch  radius  by  0.014  inch  thick  cellulose  acetate  butyrate  inflated  circular  membrane.  Stevens 
compared  his  results  for  the  deflected  membrane  shape  (which  were  apparently  empirical  in  nature) 
with  Hencky’s.  Apparently  Chien  [1948]  provided  slight  corrections  to  Hencky's  solutions  for 
maximum  values  of  stress  and  transverse  deflection  [see  also  Kao  and  Perrone,  1971]. 

Cambell  [1956]  allowed  an  arbitrary  initial  tension  in  the  membrane.  Dickey  [1967]  re¬ 
examined  the  Hencky  problem,  this  time  using  a  nonlinear  integral  equation  formulation.  Dickey 
provided  a  plot  of  u(r)  for  various  values  of  the  Poisson's  ratio.  Weil  and  Newmark  [1955]  provide 
some  experimental  evidence  that  validate  Dickey’s  predictions.  We  discuss  later  the  important 
relationship  of  u(r)  with  the  surface  precision. 

Kao  and  Perrone  [1971]  used  a  relaxation  method,  developed  earlier  by  Shaw  and  Perrone 
[1954],  to  solve  the  Hencky  problem  [see  also  Kao  and  Perrone,  1972].  Schmidt  and  DaDeppo 
[1974]  and  Schmidt  [1974]  used  a  perturbation  analysis  based  on  odd  powers  of  r  (actually  r/a),  in 
conjunction  with  the  Marguerre  shallow  shell  equations  (which  collapse  to  the  von  Karman  plate 
equations  when  there  exists  no  initial  curvature),  to  determine  the  maximum  values  of  the  stress  and 
transverse  displacement  in  the  Hencky  problem.  In  a  series  of  articles,  Storakers  [1983]  uses  a 
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power  series  in  even  powers  of  r  (or  r/a)  for  stresses  and  transverse  displacement  (odd  powers  of  r 
for  u(r)). 

Validity  of  the  Foppl-von  Karman  formulation  for  certain  regimes  of  deformation  is 
discussed  by  Weinitschke  [1980],  For  the  case  of  the  axisymmetric  defomration  of  an  annular 
membrane,  Weinitschke  considered  a  parameter  k,  (essentially  the  so-called  nondimensional  load 
parameter),  where 


1 


and  r^  is  the  radius  of  the  outer  edge  of  the  membrane.  Comparisons  were  made  between  the  Foppl 
and  the  large  rotation  Reissner  shell  (with  zero  bending  stiffness)  theories.  For  values  of  k  >  V2,  the 
difference  between  theories  exceeds  10%.  The  appropriateness  of  a  Reissner-like  theory  for  k  >  ‘/2 
is  reiterated  in  Weinitschke  [1987].  [See  also  Storakers  (1983);  for  further  details  on  the  Reissner 
theory  see  Weinitschke  (1989);  see  also  Ciarlet  (1980)  for  further  justification  of  the  von  Kannan 
equations.]  Pujara  and  Lardner  [1978]  showed  that  linear  (Hookean)  and  nonlinear  (Mooney- 
Rivlin)  elastic  constitutive  relations  provide  essentially  similar  Hencky  results  up  to  Wq/u  equal  to 
about  0.3. 

3.  NUMERICAL  ANALYSIS 

It  has  been  observed  in  inflatable  reflectors,  that  their  service  shape  deviates  from  the 
desired  parabolic  shape  as  a  function  of  position  on  the  reflector.  This  leads  to  the  so-called  “W- 
curve”  discussed  in  more  detail  below.  Hence,  one  fundamental  motivation  for  shape  control  of 
a  circular  disk  is  to  minimize  the  surface  deviation  from  the  desired  target  parabola  [Jenkins  et  al. 
(1998a,  1998b);  Jenkins  and  Marker  (1997);  Marker  and  Jenkins  (1997)]. 

It  was  seen  above  that  the  initially  plane  disk  does  not  have  an  inflated  shape  that  is 
parabolic.  Questions  naturally  arise  then  about  how  could  one  move  the  shape  to  be  more 
parabolic.  More  generally,  the  following  question  might  be  posed:  Given  “N”  actuators  equally 
spaced  around  the  rim  of  an  inflated  disk  (or  disk  pair),  what  combination  of  actuations  would 
result  in  optimizing  a  desired  effect  at  some  location  on  the  disk?  Below  we  provide  some 
results  from  an  initial  investigation  of  that  question. 
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Model  and  ABAQUS 

One  quarter  of  the  balloon  upper  surface  was  modeled  because  of  symmetry  of  the 
structure.  Model  construction,  analysis,  and  post  processing  were  conducted  using  the  nonlinear 
FEM  code  ABAQUS.  The  model  was  situated  in  the  first  quadrant  of  the  xy  plane,  and  therefore 
X  and  y  symmetry  boundary  conditions  were  applied,  as  shown  in  Figure  3-1,  noting  that 
coordinates  x,  y  and  z  are  represented  by  1,  2  and  3,  respectively,  in  ABAQUS. 


Figure  3-1  FEM  model  of  one  quarter  of  a  balloon  upper  surface,  showing  mesh,  boundary 
conditions  and  prestresses  at  its  rim,  noting  that  the  node  number  at  the  center  (lower-left  corner) 
is  100. 

As  pressure  was  applied,  large  deflection  occurred  in  the  3 -direction.  The  model 
consisted  of  48  second-order  membrane  elements  and  157  nodes.  For  convenience,  the  model 
edge  along  the  1 -direction  was  also  called  the  horizontal  line  (abbreviated  as  horiln).  Lines 


oriented  at  22.5  and  45  degrees  with  respect  to  the  1 -direction  were  named  as  line  225  (ln225) 
and  middle  line  (midln),  respectively. 

Linear  elastic  properties  were  assumed  in  the  present  modeling:  Young’s  modulus  E  = 
200  ksi,  Poisson’s  ratio  v  =  0.4.  The  geometric  dimensions  of  the  model  were:  membrane 
thickness  t  =  0.00125  inch,  and  radius  a  =  8.60  inch.  The  pressure  used  was  0.02  psi. 

Analysis  procedures 

The  analysis  was  completed  with  4  steps,  that  replicated  what  would  take  place 
experimentally.  From  step  1  to  step  3,  inflation  of  the  balloon  was  completed.  A  deflected 
meridian  curve  was  obtained,  which  provided  a  basis  to  compare  with  those  later  obtained  under 
so-called  rim  control,  i.e.,  imposed  boundary  displacements. 

1)  Step  1 

Prestress  was  first  applied  to  the  surface  with  a  load  of  0.031  lb  at  each  respective  point  of  the 
rim,  as  shown  Figure  3-1,  which  resulted  in  a  Mises  stress  in  the  membrane  of  14  to  18  psi 


(Figure  3-2)  and  a  rim  outward  displacement  of  0.0005  inch  (Figure  3-3). 


Figure  3-2  The  stress  in  the  membrane  was  around  14  to  1 8  psi  after  prestressing. 
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Figure  3-3  A  rim  displacement  of  0.00051  inch  was  achieved  after  prestressing. 


2)  Step  2 

The  rim  displacement  resulted  from  prestressing  was  fixed.  Spring  elements  were  used  to 
initiate  deflection  of  membrane  elements  under  a  pressure  of  a  small  fraction  of  0.02  psi. 

3)  Step  3 

Spring  elements  were  removed  from  the  model  and  a  full  pressure  of  0.02  psi  was 
applied.  A  maximum  deflection  of  0.4733  inch  resulted  at  the  center  (Figure  3-4),  and  Mises 
stress  in  the  membrane  of  482  to  701  psi  with  the  maximum  located  at  the  center  of  the  surface 
(Figure  3-5). 
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Figure  3-4  A  maximum  deflection  of  0.4733  inch  at  the  center  was  achieved  under  a  pressure  of 
0.02  psi,  noting  that  boundary  conditions  and  prestress  are  still  shown. 
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Figure  3-5  Mises  stress  was  around  480  to  700  psi  under  a  pressure  of  0.02  psi. 


4)  Step  4 

Imposed  outward  radius  displacements  of  0.05  inch  at  certain  positions  of  model  rim 
(radius  =  8.6  inches)  were  applied.  Detailed  results  will  be  described  below. 


4.  EXPERIMENTAL  ANALYSIS 

An  experiment  was  conducted  to  verify  the  FEM  model  described  above.  Funding 
limited  the  sophistication  of  the  experiment  performed;  however,  results  gathered  confirm  the 
model  and  provide  a  foundation  for  future  experimental  work.  The  experimental  setup  consisted 
of  a  mylar  balloon  and  the  following  subsystems  (see  Figure  4-1): 

Support  frame  and  rings 

This  system  provides  support  to  the  balloon  while  allowing  mount  locations  for  tension 
wires.  The  frame  supports  two  concentric  rings,  attached  together  and  which  pivot  with  in  the 
frame,  thus  allowing  both  the  rings  to  be  in  either  horizontal  or  vertical  positions.  The  clamping 
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devices  are  mounted  on  the  outer  ring,  while  the  inner  ring  supports  the  tensioning  cable  and  the 
balloon. 

Tensioning  system 


3 


E 


Figure  4-1.  Schematic  of  experimental  setup.  A  =  Capacitance  sensor  ;  B  =  robot;  C  = 
Tensioning  system;  D  =  Support  system;  E  -  Pressure  system. 

The  tensioning  system  is  necessary  to  apply  some  prestress  in  the  balloon  before 
inflation,  and  also  for  creating  the  required  boundary  displacements.  A  "weight  and  pulley" 
tensioning  system  has  been  chosen  for  its  ease  of  use  and  versatility.  It  consists  of  a  smooth 
inner  ring  surrounding  the  balloon,  which  acts  as  a  "pulley"  to  the  weighted  tension  lines 
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attached  directly  to  the  boundary  of  the  balloon.  Braided  Kevler  line  is  used  for  the  tensioning 
lines,  which  are  radially  attached  to  the  balloon  at  a  spacing  of  1 1.25°  see  Fig.  4-2).  With  the 
rings  in  a  horizontal  position,  a  weight  of  22.97gm  (0.06151b)  are  attached  to  the  other  end  of  the 
line,  after  it  passes  through  the  clamping  device. 


Figure  4-2.  Tensioning  system  schematic(vertical  position) 

These  lines  can  then  be  clamped  off  in  order  to  secure  the  lines  in  a  constant  tension  as 
the  balloon  prepares  to  be  rotated  to  its  mapping  (vertical)  position.  The  clamping  devices  are 
used  are  much  like  a  cable  clamp  to  secure  the  tension  lines  during  the  vertical  orientation  of  the 
balloon.  A  schematic  of  this  setup  can  be  seen  in  the  figure  below. 
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Around  tension  ring-to  balloon 


Inner  tension  ring 


Figure  4-3.  Clamping  mechanism 


The  line  travels  over  the  tension  ring,  then  down  between  the  two  washers  and  through 
the  bolt.  To  create  the  boundary  displacement,  there  is  a  provision  for  pulling  along  the  0,  45 
and  90-degree  lines  around  the  balloon,  through  use  of  turnbuckle  system. 

Pressure  control 

Maintaining  a  constant  pressure  during  the  surface  mapping  is  an  essential  requirement 
for  this  experiment.  The  pressure  control  system  consists  of  a  hand  regulator/bleed  valve 
("dynamic  system"),  which  produces  the  required  accuracy.  The  system  also  includes  a  valve 
closer  to  the  balloon  to  allow  for  measuring  the  pressure  inside  the  balloon  at  static  conditions. 
Both  the  dynamic  and  static  systems  were  verified  using  a  U-tube  manometer  with  the  resolution 
of  1/10  th  of  an  inch  H2O  (0.0036  psi)  to  check  the  specified  input  pressure.  To  achieve  a 
certain  pressure,  the  pressure  regulator  is  adjusted  until  the  desired  pressure  is  reached,  as  shown 
by  the  U-tube,  while  the  bleed  valve  is  opened  to  allow  for  air  flow.  The  combination  of  the 
pressure  and  bleed  valve  have  been  verified  to  provide  a  constant  pressure  over  a  time  period  of  1 
hour.  The  second  valve  is  also  used  to  check  the  pressure  inside  the  balloon,  simply  by  closing 
this  valve  for  a  pressure  static  condition. 


Robot  measurement  system 

The  profile  mapping  system  consisted  of  an  industrial  robot,  controlled  by  a  personal 
computer,  and  a  Capacitec  sensor.  This  robotic  mechanism  is  used  under  close  loop  control,  with 
two  degrees  of  freedom  that  are  accurate  to  0.00039  in  (0.01mm). 

The  heart  of  the  measurement  system  is  the  capacitance  displacement  sensor  (Capacitec 
model  HPT-75).  This  cylindrically-shaped  non-contact  sensor,  of  1.9  mm  (0.075  in)  core 
diameter,  acts  as  one-half  of  a  capacitor,  while  the  membrane  surface  (conductive  mylar)  acts  as 
the  other  half  An  AC  voltage  is  applied  between  the  sensor  core  and  the  membrane  surface,  and 
a  self  -contained  single  channel  amplifier  card  within  a  PC  measures  the  capacitance  of  this 
sensor/membrane  circuit.  A  scaled  DC  voltage  is  returned  to  an  output  terminal  on  the  card,  to 
be  read  for  data  acquisition.  The  sensor  has  linearity  and  repeatability  of  ±  0.2%  and  ±  0.01%  of 
full  scale,  respectively,  and  a  maximum  range  of  about  3  mm  (0.12  in). 

Data  acquisition  system 

The  principle  of  the  Capacitec  sensor  is  to  measure  the  voltage  corresponding  to  the 
sensor's  distance  from  the  surface.  In  this  experiment,  the  goal  was  to  achieve  a  fixed  voltage  for 
different  measuring  positions  around  the  balloon  surface,  thus  mapping  the  balloon  profile  along 
a  meridian. 

To  collect  the  data,  the  LabVIEW  (data  acquisition  program)  and  AML  (robot  control 
program)  software  package  were  used.  The  AML  software  was  used  to  control  the  robot 
movement  directly  from  the  computer.  The  robot  can  move  in  X  and  Z  directions  by  entering  the 
desired  coordinates  in  the  program.  This  program  has  a  precision  to  control  the  robot  movement 
to  0.01  mm. 

The  sensor  is  connected  to  the  robot  arm  through  the  hole  in  a  plate  at  the  end  of  the  arm. 
Two  nuts,  one  on  each  side  of  the  plate,  securely  mount  the  sensor  to  the  robot.  The  ground  line 
is  clamped  to  the  balloon.  Dan.vi,  a  program  under  LabVIEW  software,  gives  the  voltage 
reading  of  the  sensor  in  both  graphical  and  numerical  forms. 

After  the  balloon  was  completely  inflated,  the  robot  was  moved  toward  one  edge  of  the 
balloon,  and  the  voltage  reading  from  the  sensor  was  observed.  In  this  experiment  6  volts  was 
taken  as  the  reference  voltage,  and  at  that  position  the  X  and  Z  coordinates  were  recorded.  In  a 
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similar  way,  the  X  and  Z  coordinates  of  the  diametrically  opposite  edge  were  also  recorded.  The 
Z  coordinate  should  be  the  same  for  both  edge  positions,  to  make  sure  that  the  balloon  surface  is 
parallel  to  the  sensor  axis.  From  these  two  positions  it  is  easy  to  calculate  the  center  X 
coordinate  of  the  balloon.  To  map  the  semi  profile  of  the  balloon,  15  points  were  taken  from 
center  to  edge,  and  the  Z  coordinate  for  every  point  was  measured  at  the  reference  voltage. 
These  data  are  analyzed  to  get  the  surface  profile  of  the  balloon. 

Experimental  Procedure 

The  first  step  of  the  experiment  is  to  mount  the  balloon.  To  attach  the  tensioning  lines  to 
the  balloon,  a  template  was  made  so  that  the  lines  will  be  in  radial  directions.  Index  tabs  were 
used  to  attach  the  strings  to  the  balloon,  and  Kevler  line  was  used.  After  the  balloon  was 
prepared,  it  was  mounted  on  the  support  frame  in  horizontal  position,  with  a  uniform  tension  load 
at  the  other  end  of  each  string.  Then  the  tension  lines  were  clamped  in  the  frame.  The  clamps 
were  tightened  a  little  more  than  finger  tight  in  order  to  get  some  prestress  in  the  balloon. 

The  second  step  is  the  inflation  process.  The  balloon  was  inflated  to  a  desired  pressure 
by  adjusting  the  regulator.  The  bleed  valve  was  open  to  maintain  contain  pressure  inside  the 
balloon.  At  this  point,  the  entire  system  was  left  for  1/2  hour  to  allow  the  balloon  to  achieve  the 
desired  pressure,  and  to  observe  any  pressure  difference. 

Third,  the  measuring  device,  i.e.,  the  personal  computer,  Capacitec  sensor,  and  the 
industrial  robot  were  prepared.  When  ready,  the  frame  was  flipped  to  the  mapping  (vertical) 
position,  and  again  the  pressure  inside  the  balloon  was  checked. 

Finally  the  robotic  arm  with  the  sensor  was  brought  toward  the  balloon  as  discussed 
previously,  and  the  mapping  is  begun.  These  data  are  used  to  analyze  the  balloon  profile  and 
deviation  from  the  parabola  with  inflation  only.  After  these  measurements  were  done,  the 
balloon  was  flipped  horizontally  to  apply  the  boundary  displacements  at  the  0,  45,  and  90  lines. 
Then  the  balloon  is  flipped  vertically  again  to  measure  the  surface  now  resulting  from  the 
boundary  displacements.  The  entire  process  was  done  with  the  balloon  inflated  at  the  same 
pressure.  The  data  are  then  analyzed  to  see  the  effect  of  boundary  displacements  on  the  surface 
profile. 
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5.  RESULTS  AND  DISCUSSION 

FEM  deflection  data  before  and  after  imposing  the  boundary  displacements  are  listed  in  Table  5- 

1. 


Radius 

w* 

Wo** 

y.,  ^  ^ 

^3(ln225) 

0 

0.4733 

0.1521 

0.4095 

0.4095 

0.75 

0.47 

0.151 

0.4066 

0.4066 

1.50 

0.4603 

0.1475 

0.3978 

0.3978 

2.25 

0.4439 

0.1417 

0.383 

0.383 

3.00 

0.4208 

0.1337 

0.3621 

0.3622 

3.75 

0.3908 

0.1233 

0.3351 

0.3355 

4.50 

0.3535 

0.1106 

0.3009 

0.3034 

5.25 

0.3087 

9.56E-02 

0.26 

0.2642 

6.00 

0.2558 

7.82E-02 

0.2087 

_ _ _ -  i 

0.2215 

6.75 

0.1944 

5.85E-02 

0.1401 

0.181 

7.50 

! 

0.1236 

3.65E-02 

6.51E-02 

0.1353 

8.05 

6.49E-02 

1.89E-02 

2.25E-02 

1 

! 

4.12E-02 

8.60 

O.OOE+00 

O.OOE+00 

O.OOE+00 

O.OOE+00 

*  Before  imposing  displacements 
**  Imposing  displacements  on  entire  rim 

***  Imposing  displacements  at  3  positions  of  0,  45  and  90  degrees  on  the  rim 

Table  5-1  FEM  deflection  of  respective  meridian  curves  before  and  after  imposing 

displacements  (in  inch). 

Deviation  of  FEM  results  from  parabolie  eurve 

The  deflection  of  the  meridian  curve  of  the  model  without  imposed  displacements  was 
obtain  after  completing  the  analysis  of  Step  3  discussed  in  Section  3.  As  an  FEM  meridian  curve 
(hereafter  referred  to  as  FEM  curve),  w,  was  found  under  a  certain  pressure,  p,  the  intersection 
points  between  the  curve  and  coordinate  axes,  i.e.,  the  maximum  deflection,  at  the  center  of 
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the  balloon  surface  and  the  radius,  a,  of  the  surface  can  be  used  to  define  the  con-esponding 
parabolic  curve  (hereafter  referred  to  as  parabola),  vi'^  according  to  the  equation 


=  (7) 

A  comparison  of  an  FEM  curve  under  a  pressure  of  0.02  psi  with  its  corresponding 
parabola  shows  that  the  FEM  curve  has  larger  deflection  than  the  parabola,  as  showing  in  Figure 
5-1.  The  difference  between  these  curves,  is  a  “W”  shaped  curve,  as  also  shown  in  the 

same  figure,  noting  that  only  a  half  of  the  “W”  curve  is  shown  due  to  symmetry,  and  the  curve 
has  been  magnified  by  10  times  for  convenience. 


Comparison  of  parabola  with  FEM  results  with  no  imposed  displacement 


Radius  (inches) 


Figure  5-1  A  comparison  between  FEM  meridian  curve  and  its  corresponding  parabola,  noting 
that  the  deviation  of  the  FEM  curve  from  the  parabola  is  a  “W”  curve  which  is  magnified  by  1 0 
times  in  the  figure. 

In  the  following  text,  as  far  as  the  word  deviation  is  used,  it  is  always  referred  to  a 
difference  between  an  FEM  curve  and  its  corresponding  parabola.  Normalization  of  the 
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deviation  and  model  radius  v^ill  be  employed  for  general  understanding  of  the  deflection 
tendency  with  and  without  rim  control. 

Reduction  of  the  deviation  through  rim  control 

The  deviation  of  a  meridian  curve  from  its  corresponding  parabola  can  be  reduced 
through  rim  control.  Specifically,  0.05  inch  outward  radial  displacements  were  imposed  on 
certain  positions  along  the  rim  of  the  model.  Two  cases  of  imposed  positions  on  the  rim  were 
investigated: 

1)  at  0, 45  and  90  degrees;  and 

2)  a  uniform  displacement  imposed  on  the  entire  rim. 

The  following  symbols  will  be  used  in  some  figures  (noting  that  “n”  denotes 
“normalized”,  “p”  denotes  “parabola”,  “0”  denotes  “uniform  rim  displacement”  and  “3”  denotes 
“under  the  case  of  3  positions  on  the  rim”): 

Wn,  normalized  deflection  before  imposing  a  displacement 
Wnp,  normalized  parabola  corresponding  to  w,,; 

Won,  normalized  deflection  after  a  displacement  was  imposed  on  entire  rim; 

Wonp,  normalized  parabola  corresponding  to  Won,; 

W3n,  normalized  deflection  of  a  meridian  curve  along  which  one  of  the  3  displacements  were 
imposed  on  the  rim; 

W3np,  normalized  parabola  corresponding  to  W3n; 

W3(in225)n5  normalized  deflection  of  ln225  after  3  displacements  were  imposed  on  the  rim; 
W3(in225)np>  normalized  parabola  corresponding  to  W3(in225)n- 
Imposed  displacements  at  0,  45,  and  90  degrees 

After  0.05-inch  displacements  were  imposed  at  the  positions  of  0,  45,  and  90  degrees,  the 
maximum  deflection  of  the  model  was  reduced  to  0.4095  inch,  as  shown  in  Figure  5-2,  13%  less 
than  that  before  imposed  boundary  displacements.  In  the  same  figure,  boundary  conditions 
including  the  imposed  displacement  directions  are  also  indicated.  Deflections  of  lines  of  horiln, 
ln225  and  midln  were  plotted  in  Figure  5-3.  For  comparison,  the  FEM  meridian  curve  before 
imposed  boundary  displacement,  denoted  as  midlnO,  is  also  included  in  the  figure. 
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Figure  5-2  The  maximum  deflection  was  reduced  by  13%  after  an  imposed  displacement  of  0.05 


inch  applied  at  3  positions  on  the  rim. 
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Figure  5-3  FEM  meridian  curves  at  0,  22.5,  and  45  degrees. 


It  is  found  that  there  was  a  bulge  in  midIn  and  a  concavity  in  horiln,  which  were  close  to 
the  rim.  In  order  to  have  a  full  conception  of  the  shape  of  the  model  in  this  state,  profiles  of 
circumferential  curves  with  radii  of  1.50,  3.00,  4.50,  6.00,  7.50  and  8.60  inches,  respectively,  are 
given  in  Figure  5-4,  showing  that  when  the  radius  was  greater  than  6  inches,  these  curves  were 
no  longer  of  constant  z-deflection. 
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Imposed  displacements  at  0, 45,  and  90  degrees 
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Figure  5-5  Deviations  of  horiln  and  ln225  were  reduced  in  the  area  with  a  radius  60%  of  the 
model  radius. 


Displacement  imposed  on  entire  rim. 

After  0.05  inch  outward  displacement  was  imposed  on  the  entire  rim,  as  shown  in  Figure 
5-6,  the  maximum  deflection  was  0.1521  inch,  a  reduction  of  68%  compared  with  the  deflection 
of  0.4722  inch  before  the  displacement.  The  deviation  was  also  dramatically  reduced,  as  seen 
form  Figure  5-7.  Before  boundary  displacements,  the  deviation  was  2.7%;  the  deviation  became 
0.1%  after  uniform  displacements.  Stress  distribution  was  almost  uniform,  as  shown  in  Figure  5- 
8,  and  increased  by  3  times  compared  with  that  before  boundary  displacements. 
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Figure  5-6  The  maximum  deflection  was  greatly  reduced  after  a  displacement  of  0.05  inch  was 


uniformly  imposed  on  the  rim  of  the  model. 
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Imposed  displacements  at  all  directions 
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Figure  5-7  The  maximum  deviation  from  parabola  was  reduced  to  0.1%  after  a  displacement  of 
0.05  inch  was  uniformly  imposed  on  entire  rim,  compared  with  2.7%  before  boundary 
displacements. 
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Figure  5-8  Stress  in  the  model  was  almost  uniform  after  an  outward  radial  displacement  of  0.05 
inch  was  uniformly  imposed  on  the  entire  rim. 


The  experimental  results  are  presented  below  in  graphical  manner,  and  show  how  the 
pressure  and  boundary  displacements  effect  the  balloon  profile. 

Effect  of  pressure 

In  Figure  5-9,  the  experimentally  measured  profile  (w^)  of  the  balloon  along  a  meridian  at 
two  constant  pressures  are  shown,  along  with  the  desired  parabolic  shape  (Wp),  and  the  deviation 
from  the  parabola  (Wg-Wp).  The  parabola  plotted  here  is  drawn  to  fit  through  the  apex  and 
boundary  of  the  balloon  profile.  Both  the  vertical  displacements  and  the  radial  positions  are 
normalized  with  respect  to  their  maximum  values,  to  make  the  effects  clearer.  Then  both  Fig.  5- 
9a  and  5-9b  are  compared  in  5-9c  to  show  the  effect  of  pressure  on  the  profile  of  the  balloon. 
From  this  third  figure  it  can  be  seen  that  at  the  higher  pressure  the  profile  is  closer  to  the  desired 
shape;  however,  this  may  be  an  artifact  of  too  low  a  pressure  in  the  first  case  (Fig.5-9a). 
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Comparison  of  experimental  data  (p=.01psi)  and  parabola 
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Figure  5-9  Experimentally  measured  profiles,  a)  p  =  O.Olpsi;  b)  p  =  O.OSpsi;  c)  comparing  a)  and 


Effect  of  boundary  displacements 

Two  experiments  at  different  pressures  were  conducted  to  see  the  effect  of  boundary 
displacement  on  surface  profile,  and  the  results  are  plotted  in  Fig.  5-10.  The  results  show  that  the 
profile  of  the  balloon  with  the  boundary  displacement  is  closer  to  the  parabola  than  the  inflation 
only  case.  This  is  the  same  trend  seen  in  the  FEM  model. 


Comparison  of  experimental  data(.015psi)  and  parabola  fit  to  the 
ends  and  the  effect  of  boundary  displacement(.05in) 

i 


«  we(delta=0)  ji 
m  wp 

.  »  we-wp  !: 
_*_we(delta=.05)’' 
we(0.05)-wp 


Comparison  of  experimental  data(.02psi)  and  parabola  fit  to  the 
ends  and  the  effect  of  boundary  displacement(.05) 


Figure  5-10.  Experimentally  measured  profiles  with  the  boundary  displacement  of  0.05  in  a)  p  = 
0.01 5psi;  b)  p  =  0.02  psi. 
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6.  CONCLUSIONS 

Inflatable  structures  have  been  used  for  many  years  in  terrestrial  applications,  from  sports 
stadium  roofs  to  automotive  airbags.  Inflatable  will  achieve  their  greatest  potential,  however,  in 
space-based  applications.  Along  with  their  high  strength  to  weight  ratio,  the  capability  of 
reduced  launch  mass  and  volume  make  inflatables  highly  desirable  candidates  for  space-based 
structures. 

This  investigation  has  focused  on  initial  inquiries  into  the  shape  control  of  inflatable 
structures,  specifically  a  thin  inflated  disk.  It  has  been  shown  that  the  shape  of  an  initially  plane 
circular  disk  is  not  inherently  parabolic.  Radial  displacements  at  the  boundary  can,  however, 
move  the  surface  toward  a  parabolic  shape.  Moreover,  this  effect  is  pronounced  at  locations 
distant  from  the  locations  where  the  discrete  boundary  displacements  are  applied.  The  so-called 
“W-curve”  has  been  shown  to  be  reduced  by  imposed  outward  radial  displacements  at  the 
boundary. 
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Abstract 

Finite-Difference  Time  Domain  (FD-TD)  algorithms  are  developed  to  study  the 
transformation  of  an  electromagnetic  wave  by  a  dynamic  (time-varying)  inhomogeneous 
magnetized  plasma  medium.  One  dimensional  code  is  used  to  verify  the  results  obtained 
earlier  using  approximate-analytical  results  based  on  (a)  WKB  method  for  slow  rise  time 
of  the  profile  (b)  Green’s  function  technique  for  rapid  switching  of  the  plasma  medium. 
Spectrum  analysis  using  an  enhanced  Fast  Fourier  Transform  (FFT)  technique  is  used  to 
obtain  the  amplitude  and  the  frequencies  of  the  new  modes  generated  by  the  switching  of 
the  medium. 

A  three  dimensional  FD-TD  code,  valid  for  arbitrary  variation  in  space  and  time 
of  (a)  plasma  density,  (b)  direction  and  strength  of  static  magnetic  field,  and  (c)  collision 
frequency,  is  developed.  The  accuracy  and  the  stability  of  the  three  dimensional  code  is 
being  examined  by  studying  the  mode  coupling  in  a  rectangular  plasma  cavity  containing 
the  dynamic  inhomogeneous  magnetized  plasma.  The  code  will  be  used  to  study  the 
remarkable  transformation  of  an  electromagnetic  wave  and  explore  its  applications. 
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Numerical  Simulation  of  Electromagnetic  Wave  Transformation  in  a  Dynamic 

Magnetized  Plasma 

Dikshitulu  K.  Kalluri 
Joo  Hwa  Lee 


1  Introduction 

The  FD-TD  method  [l]-[3]  has  become  a  very  powerful  method  to  analyze 
electromagnetic  problems  due  to  its  simplicity  and  accuracy.  It  is  a  natural  representation 
of  Maxwell’s  equations  in  a  discretized  form. 

In  this  report,  we  study  the  effects  of  time-varying  and  space-varying  magnetized 
plasma  medium  on  an  electromagnetic  wave.  Most  time-varying  plasma  problems  do  not 
have  exact  solutions  and  are  likely  to  be  described  by  high-order  differential  equations. 
Hence,  only  very  simple  problems  are  solved  exactly  and  many  problems  are  left 
unsolved.  Due  to  their  high-order  complexity,  approximate  solutions  have  been  given  in 
several  forms.  WKB  methods  [4]-[5]  are  used  for  slowly  varying  plasmas  and  Green’s 
function  methods  [6]  are  used  for  fast-profile  plasma  problem.  These  solutions  are  not 
verified  in  some  cases  because  numerical  solutions  usually  give  the  total  fields.  The 
WKB  solution  for  some  of  the  modes  has  very  weak  field  amplitudes  and  they  are  hard  to 
detect  in  the  total  fields.  So,  we  need  to  find  a  tool  to  analyze  the  weak  components 
(modes)  from  the  output  signals  in  the  computer  simulations.  Green’s  function  methods 
for  fast  profile  can  give  very  good  approximate  solutions  for  small  perturbations.  When 
the  perturbations  are  large,  the  solution  tends  to  degrade  quickly. 

In  this  report  we  will  verify  the  validity  of  the  solutions  given  by  WKB  methods 
and  Green’s  function  methods  to  ensure  the  validity  of  the  numerical  and  analytical 
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approaches.  This  will  be  done  by  the  FD-TD  method  to  find  time  series  for  the  total 
fields.  The  frequencies  and  the  amplitudes  of  the  various  modes  will  be  obtained  by 
using  a  modified  FFT  method. 

2  One  Dimensional  FD-TD  Algorithm 


2. 1  One  Dimensional  Equations 

Consider  a  magnetoplasma  medium  with  time-varying  and  space-varying  plasma 
frequency  in  one  dimension.  The  Maxwell’s  equations  and  the  constitutive  relation  for 
the  plasma  medium  [7]  are  given  by 


V7  ,7 

VxE  =  -|io  — , 
ot 


„  „  9E 

VxH  =  £0-— +  J, 
at 

^  =  eoCoJ(z,OE  +  cObXj. 


(1) 


(2) 


(3) 


The  electromagnetic  wave  is  assumed  to  be  propagating  in  z-direction  and  the  static 
magnetic  field  is  also  assumed  to  be  in  z-direction.  The  natural  modes  of  this  case  are 
circularly  polarized  waves.  For  the  specific  example  of  a  right  circularly  polarized  wave 
(R-wave),  the  fields  may  be  written  as: 

E  =  (x-;y)£'(z,0,  (4) 

^  =  {jx+y)H{z,t),  (5) 

3  =  {x-  jy)Jiz,t),  (6) 

where  j  =  . 

From  (l)-(3)  and  (4)-(6),  we  will  get  the  following  one  dimensional  equations: 
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(7) 


dH  _  \  dE 

dt  |Io  dz  ’ 

dE  ^  \  dH  1 

dt  Eg  Eg 


=  egCoJ(z,0£  +  7COfc7. 


These  equations  (7),  (8),  and  (9)  are  valid  for  both  isotropic  (co^  =  0)  and  magnetized 
plasmas  (to^  5^0). 


2.2  Grid  Generation 

We  can  use  several  grid  configurations  to  discretize  (7)-(9).  Let  z  =  kAz  and 
t  =  nAt .  We  can  use  the  standard  Yee’s  grids  for  E  and  H  [1].  As  (9)  implies,  E  and  J 
lie  on  the  same  space  coordinate  but  will  be  offset  by  V2  time  step. 


2.3  FD-TD  Formuiation 


Using  central  difference  formulas  we  generate  the  following  approximations  for 


(7)-(9). 


Jt-H  k-hk 


1  E  -E 

A  t+l  k 


C‘-€ 


1  k-\~  1  yp+2 


At  ^  2 


These  equations  reduce  to 
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(13) 


A? 


H 


l*+i 


=  i+MAf/2  ..4  H» 

'*  l-;co^A//2  l-;co^Ar/2 

Initial  conditions  are  given  at  «  =  1  /  2  for  7  and  //  and  n  =  0  for  E. 


(14) 


(15) 


2.4  Spectrum  Analysis  using  an  Enhanced  FFT 

A  spectrum  analysis  is  a  useful  tool  to  extract  information  on  signals.  Usually  a 
Fast  Fourier  Transform  (FFT)  is  used  to  compute  the  scattering  coefficients  in  the 
scattered  fields  when  an  electromagnetic  wave  is  propagating  in  different  media  [8].  A 
pulsed  incident  wave  is  assumed  and  its  power  spectrum  or  amplitude  spectrum  is 
computed.  After  scattering  takes  place,  the  power  spectrum  of  the  scattered  fields  is 
obtained  and  the  ratio  of  the  two  spectra  gives  the  scattering  coefficients  as  a  function  of 
frequency.  This  method  allows  us  to  find  the  frequency  response  over  a  wide  range. 
However,  the  frequency  accuracy  is  limited  to  the  sampling  frequency  that  is  half  the 
inverse  of  the  sampling  rate.  We  may  assume  that  the  frequency  of  a  given  mode  is 
likely  to  lie  between  two  adjacent  frequencies  determined  by  the  FFT  analysis. 
Therefore,  the  maximum  error  in  the  frequency  estimation  is  half  the  frequency  spacing. 
This  frequency  error  may  not  be  serious  for  the  modes  whose  amplitudes  are  large 
enough.  However,  small  amplitude  modes  can  be  buried  or  undetectable  due  to  the  errors 
of  poor  frequency  estimations.  This  will  result  in  misinterpretation  of  the  results.  In 
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order  to  remedy  this  problem,  we  need  to  estimate  accurately  the  frequencies  in  the 
signals. 

Let  ^  be  a  signal  with  N  samples. 

Sn=s(t„),  n  =  l,2,---,N .  (16) 

s„  is  assumed  to  be  written  as 

M 

(17) 

m=l 

where  M  is  the  number  of  modes,  A„  is  the  amplitude,  and  is  the  frequency  of  a 
mode.  Let  FFT  be  a  fast  Fourier  transform  operator  given  by: 

s,  =  FFT(s)  =  J-Y  ^ 

Our  object  is  to  find  and  as  accurately  as  possible.  We  may  think  of  a  least 
square  method; 

N  M  ^ 

S5£  =  2j.-£A.e»-'-  .  (19) 

n=l  m=l 

However,  this  equation  is  nonlinear  in  A„  and  /„  and  therefore  is  not  easy  to  handle. 
The  authors  have  used  the  following  procedures  to  find  these  parameters  accurately: 

Step  1  Let  A^  =0  for  m  =  1,2,  --,M  . 

Step  2  Repeat  Step  3  to  5  for  m  =  1,2,  •  •  • ,  Af 

M 

Step  3  K  -  ~  ^  A,e^''-^''"  for  n  =  1,2, •••,N . 

l=U*m 
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Step  4 


Choose  the  frequency  that  gives  the  maximum  amplitude  by  the 


FFr(5j  and  call  it  /„. 


Step  5 


Accurately  find  the  frequency  by  maximizing  given  by  (20)  in 


the  neighborhood  of  the  frequency  using  an  optimization 
technique  available  in  MATLAB  as  fmin. 


1 

A  =  — 'V  ^ 

m  ^  Zj  n 


(20) 


71=1 


Step  6  Repeat  Step  2  to  Step  5  if  the  following  condition  is  not  satisfied: 


n^] 


M 


/=1 


<  tolerance . 


(21) 


This  procedure  eliminates  the  mutual  interference  between  two  adjacent 
components  and  accurately  finds  frequencies  and  amplitudes.  Figures  2  and  3  are 
examples  of  the  signal  analysis  by  the  above  procedure.  Figure  2  shows  the  real  and 
imaginary  parts  of  a  signal  sit) .  Figure  3  shows  the  power  spectral  density  of  s{t) .  The 
broken  line  is  obtained  by  an  ordinary  FFT  method  [8].  This  reveals  two  distinct  peaks 
and  two  small  peaks.  The  solid  line  is  obtained  by  the  new  method  and  shows  four 
distinct  peaks.  For  the  strong  modes  there  exist  small  differences  in  frequencies  and 
amplitudes  between  the  two  methods.  For  the  weakest  mode,  the  difference  between  the 
two  methods  is  huge,  about  40dB,  which  means  the  conventional  method  may  not 
produce  accurate  result  for  the  weak  signal.  In  this  figure,  the  residual  signal  by  the  new 
method  is  below  -lOOdB,  which  assures  the  accuracy  of  the  new  method.  This  method 
has  a  slow  convergence  when  two  frequencies  are  very  close. 
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Note  that  we  need  to  use  a  complex  signal  to  distinguish  reflected  waves  from 
transmitted  waves.  However  the  problems  under  discussion  generate  complex  time 


senes. 


3  Unbounded  Isotropic  Plasma 

3. 1  Simulation  of  an  Unbounded  Isotropic  Plasma 

Simulation  of  an  unbounded  problem  by  FD-TD  requires  a  special  treatment  since 
an  infinite  space  is  taken  into  consideration.  For  a  space-invariant  time-varying  medium, 
the  simulation  can  be  simplified  by  noting  that  the  wavenumber  is  conserved  during  a 
temporal  change  of  the  plasma  medium  [9].  This  implies  that  the  wavelength  X  of  the 
waves  in  the  plasma  medium  remains  the  same  in  the  entire  procedure.  Hence  the  waves 
have  the  following  property: 


E(t,z)  =  E{t,z  +  X), 

(22) 

H(t,z)  =  H(i,z  +  X), 

(23) 

J(t,z)  =  J{t,z  +  X). 

(24) 

size  of  the  spatial  domain.  As  a  result, 

fI"  -  fI" 

^\k 

(25) 

(26) 

The  computation  molecules  (13)  and  (14)  at  both  boundaries  will  be  given  as 

pi"'*''  _  pi"  fiJ’l"'''!  ^ 
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(28) 

3.2  Sudden  Creation 

It  is  assumed  that  the  incident  wave  (source  wave)  is 

propagating  in  free  space  in 

the  positive  z  direction  with  the  frequency  cOq  : 

(29) 

(30) 

7  =  0, 

(31) 

where  /cq  =  cOg  /  c .  To  simplify  the  eomputation  we  may  assume  c  =  Eq  =  =  1 .  For  t  < 

0,  the  entire  space  is  considered  to  be  free  space  and  at  t  =  0,  the  entire  space  is  assumed 
to  be  converted  into  a  plasma  medium  with  the  plasma  frequency  00^  .  By  the  sudden 

creation  of  the  plasma  medium,  new  waves  are  generated  [7]  and  their  properties  are 
summarized  in  Table  1. 


Table  1  Summary  of  Frequencies  and  amplitudes  for  the  sudden  switching  case 

MODE  Initial  Values  I  Mode  1  I  Mode  2  1  Mode  3 
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In  the  FD-TD  simulation,  we  also  assume  that  (Oq  =  1  for  convenience.  The  FD-TD 
results  are  shown  in  Figures  4-7.  Both  frequencies  and  amplitudes  are  well  matched. 


3.3  Slow  Creation 


The  plasma  frequency  is  slowly  varied  from  0  to  a  certain  value.  Generally  an 
exact  solution  to  this  slowly-varying  plasma  creation  problem  does  not  exist.  However, 
the  WKB  method  can  be  used  to  obtain  the  amplitudes  of  the  new  waves.  We  assumed 
an  exponential  plasma  profile  for  the  simulation: 

=  (32) 


where  is  the  rise  time.  In  this  profile,  a  WKB  solution  for  the  electric  fields  is  given 

by  [10] 


E,= 


(33) 


The  results  are  shown  in  Figs.  8  and  9.  As  expected,  WKB  solutions  match  with  the  FD- 
TD  solution  for  large  .  However  we  see  in  Fig.  9  a  deviation  in  the  curves  for 
larger  than  70.  This  is  because  the  FD-TD  simulation  has  been  obtained  to  a  certain 
accuracy.  When  this  accuracy  is  improved,  they  match  well. 


3.4  Rapid  Creation  -  Linear  Profile 

When  the  switching  action  is  fast,  the  WKB  method  may  not  be  suitable.  Also 
exact  solutions  for  the  fast  switching  profiles  do  not  exist  in  general.  A  different 
approach  has  been  used  to  handle  this  kind  of  problem  [6].  A  Green’s  function  method 
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gives  an  approximate  solution  by  using  a  perturbation  technique.  The  difference  function 
between  the  actual  profile  and  a  reference  profile  is  used  to  generate  a  perturbation  series. 
The  reference  profile  must  have  an  exact  solution.  By  its  nature,  this  method  has  two 
fundamental  limitations.  One  is  the  second  order  approximation  can  become  very 
complicated.  The  other  is  that  it  is  hard  to  determine  its  range  of  validity.  By  using  the 
FD-TD  method,  we  can  easily  verify  their  accuracy  and  validity  ranges. 

A  linear  profile  as  shown  in  Fig.  10  is  an  example  of  a  problem  having  an  exact 
solution  that  can  be  used  to  test  the  validity  of  the  three  methods,  i.e.,  WKB,  Green’s 
function  method,  and  FD-TD.  Figures  11  and  12  show  such  comparisons.  The  FD-TD 
and  theoretical  results  agree  in  the  wide  range  of  rise  time  of  the  plasma  profile.  Green’s 
function  solutions  are  given  in  [6]  and  (33)  and  (34)  are  used  for  the  WBK  method.  It  is 
seen  that  for  the  fast  profile  (7)  is  small)  the  Green’s  function  method  is  valid  while  for 
the  slow  profile  ( T)  is  large)  the  WKB  method  is  valid. 

3.5  Rapid  Creation  -  Hump  Profile 

A  hump  profile  has  been  used  to  show  applicability  of  the  Green’s  function 
method  [6]  because  it  could  handle  complicated  profiles.  The  FD-TD  solutions  are 
compared  with  the  Green’s  function  method  in  Figs.  13  and  14.  Figure  13a  shows  the 
sketch  of  the  hump  file. 

3.6  Periodic  Plasma  Slab 

Interaction  of  an  electromagnetic  wave  with  a  rapidly  created  spatially  periodic 
plasma  was  studied  by  S.  P.  Kuo  and  James  Faith  [11].  It  can  be  extended  to  the 
magnetized  plasma  slab. 
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In  a  periodic  isotropic  plasma  slab,  the  wave  can  be  represented  as: 


E{z)  = 


Aexp[-  jkz]+  Bexp\+  jkz\ 


-I  <  z<l 


C  exp[-  jkr\(z  -l)]+D  exp[+  jkr\(z  -/)]  I  <z<l  +  d' 


where  rj  is  the  index  of  refraction,  L  is  the  separation  between  two  adjacent  slabs,  d  is  the 
width  of  the  slab,  and  l  =  {L-d)/2.  Due  to  the  periodicity,  the  wave  has  to  satisfy  the 
Bloch  wave  condition  [11]-[12]: 

E(z)  =  e^’^^E(z  +  L)  (36) 

where  P  is  the  propagation  constant. 

Applying  boundary  conditions  at  z  =  I  and  z  =  -l  and  (36),  the  dispersion  relation  can 
be  obtained  as  [11]: 

cos  pL  =  cos  kx\d  cos  2kl  — ^  (T)  +  — )  sin  kr[d  sin  2kl ,  (37) 

2  ri 


where 


PL  =  kQL  +  2m‘K 

P  ,  2m% 

—  =  1  + - 


Based  on  this  relation,  we  can  construct  the  electric  fields  for  r  >  0  as 


m=+ov 

Z,t)= 


It  is  noted  from  (36)  that  for  any  P  we  have  the  following  condition: 


Hence  we  can  simulate  the  problem  easily  without  having  infinite  cells,  but  one  cell  with 
a  free  space  and  a  plasma  slab.  In  the  Yee’s  cell,  we  may  write  FD-TD  equations  as 
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(41) 


lA'+l 


-JkoL 


ff  ^  ij  jk^L 

0  K  ^ 


(42) 


These  relations  as  well  as  (37)  hold  for  magnetized  plasma  media  where  77  is  given  by 


0) 


pO 


co(a)-coJ 


(43) 


In  free  space,  =  co„|/c .  Therefore,  (39)  can  be  written  as 


E{z,t) 


/n=-h>o 

ffI=-oo 


In  plasma,  (o^  =  Tj^^c  and  (39)  becomes 


(44) 


/n=+«  P  - 

£(z,0= 


(45) 


Work  is  in  progress  to  extend  the  results  obtained  by  Kuo  [1 1]  to  the  case  of  a  switched 
spatially  periodic  magnetized  plasma  medium  with  particular  emphasis  on  the  effect  of 
the  collision  frequency  and  the  mode  amplitudes. 


4  Plasma  Slab  Problem 

4.7  PML 

The  effect  of  spatial  boundaries  may  be  studied  by  considering  the  slab  problem. 
It  is  an  one  dimensional  problem,  however  we  need  to  define  the  appropriate  outer 
boundary  for  the  truncating  the  lattice  [3]  and  thus  limit  the  size  of  the  field  computation 
domain.  At  the  boundaries  of  the  lattice  a  suitable  boundary  condition  must  be  used  to 
simulate  its  extension  to  infinite.  The  waves  propagating  outward  must  be  terminated  at 
the  boundaries  without  any  reflections.  Such  a  boundary  condition  is  called  absorbing 
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boundary  condition  (ABC).  The  most  recent  advance  in  material-based  ABC’s  was  put 
forward  by  Berenger  [2].  His  ABC,  termed  the  perfectly  matched  layer  (PML)  absorbing 
boundary  condition,  appears  to  yield  a  major  improvement  in  the  reduction  of  boundary 
reflections  compared  to  any  ABC  proposed  previously. 

4.2  FD-TD  Simulation  of  Switched  Plasma  Slab 

Using  FD-TD  algorithm  and  PML  condition  we  simulated  the  frequency  shifting 
property  of  a  plasma  slab  and  the  result  is  in  agreement  with  our  analytical  results 
obtained  earlier  for  the  case  of  sudden  switching.  Work  is  in  progress  for  the  case  of  an 
arbitrary  profile  of  the  plasma  density  in  the  slab. 


5  Three  Dimensional  Resonator 

5. 1  Three  Dimensional  FD-TD  for  a  Magnetoplasma  with  Collisions 

The  governing  equations  are  given  [7]  by  (1),  (2),  and  (46), 

+  v(r,  0  J  (r,  t)  =  Eq®  J  (r,  OE(r,  +  (46) 

where  the  gyrofrequency  and  written  in  cartesian  components  as 

tOb  +  +  (47) 

In  (46)  V  is  the  collision  frequency.  The  FD-TD  equations  by  the  Yee’s  formula  are  given 
by 
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In  Equations  (54)-(56),  J ^  can  not  be  expressed  in  terms  of  the  values 

of  previous  time  steps  but  become  simultaneous  equations  that  we  must  avoid.  We  can 
avoid  this  by  introducing  a  matrix  equation.  After  simplifying  and  combining  (54)-(56) 
we  can  construct  a  matrix  equation  as 


In  (57),  v,co^,co^ 


(Oj ,  and  (Op  are  the  values  at  the  grid  point  •  y  ^  •  The  final  FD-TD 


equations  for  E  and  H  follow: 


5.2  Three  dimension  FD-TD  Grid 

To  implement  the  FD-TD  algorithm  given  by  (57)-(63)  in  a  rectangular  resonator, 
we  need  to  extend  Fig.  1  to  the  three  dimensional  space.  J  can  be  placed  in  the  center  of 
the  cubic  unit  cell  of  the  Yee  space  lattice  as  shown  in  Fig.  15.  Figure  16  shows  two 
planes  for  visualizing  the  implementation  of  the  boundary  conditions.  Assigning  zero 
values  to  the  field  components  at  the  boundaries  automatically  satisfy  the  boundary 
conditions  on  the  walls  of  the  rectangular  resonator  bounded  by  perfect  conductors. 
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5.3  Numerical  Stability 


One  important  aspect  of  FD-TD  algorithm  is  the  optimum  choice  of  the  time  step. 
Too  large  a  time  step  will  result  in  numerical  instability.  Figure  17  shows  the  maximum 
time  step  as  a  function  of  the  final  plasma  frequency. 

5.4  Magnetized  Plasma  in  the  Cavity 

We  are  presently  simulating  the  mode  coupling  when  the  plasma,  in  addition,  is 
magnetized.  A  complete  report  will  be  published  soon. 

6  Conclusion 

We  have  seen  that  the  FD-TD  method  can  easily  simulate  various  problems  and 
give  very  accurate  solutions.  Analyzing  the  solution  by  FD-TD  has  been  done  usually  by 
FFT  method  that  has  its  limitation  of  resolutions.  A  new  analyzing  tool  is  presented  that 
can  solve  conventional  limitations.  It  resolves  several  modes  even  if  some  of  them  have 
low  amplitudes. 
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Figure  1.  Position  of  E,  H,  and  J  for  one  dimensional  problem. 
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Figure  2.  Complex  signal  in  time. 
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Figure  3.  Power  Spectral  Density. 
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Figure  7.  Current  densities  vs.  plasma  frequency  for  the  sudden  switching  case. 
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Figure  10.  Plasma  frequency  for  a  linear  profile. 
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Construction  of  planes  for  the  automatic 
implementation  of  the  boundary  conditions. 
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Figure  17  Maximum  time  step  vs.  the  plasma  frequency. 
The  optimum  value  At  is  normalized  with  respect  to  At^ 
which  is  the  ’magic’ time  step  in  free  space.  See  [3]. 
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Abstract 

Molten  material  is  removed  in  many  laser  applications  such  as  laser  grooving  and  laser 
cutting.  An  assist  gas  is  usually  used  to  remove  the  molten  material  from  the  laser  material 
interaction  zone.  The  effect  of  assist  gas  pressure  on  the  material  removal  rate  is 
investigated  in  this  paper  for  laser  cutting  and  grooving  applications.  The  model  for  melt 
depth  is  based  on  the  overall  energy  balance,  and  the  cut  depth  is  obtained  by  considering 
the  effect  of  the  assist  gas.  The  model  for  kerf  width  is  based  on  the  modified  Rosenthal 
solution  taking  into  account  the  melting  effect.  The  cut  depths  reach  a  constant  value 
beyond  a  critical  pressure  if  the  kerf  width  is  of  the  order  of  nozzle  width  and  through  cuts 
are  assumed.  Most  of  the  molten  material  is  removed  by  the  assist  gas  at  pressures  below 
this  critical  pressure.  The  model  predicts  on  the  basis  of  the  Prandtl  or  Meyer  relation  that 
the  cutting  speed  decreases  when  the  assist  gas  pressure  exceeds  a  critical  value  if  the  kerf 
dimensions  are  smaller  than  the  nozzle  dimensions. 
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IMPROVED  CHEMICAL  OXYGEN-IODINE  LASER  (COIL)  CUTTING  MODELS  TO 

OPTIMIZE  LASER  PARAMETERS 
Khizar  Farooq  and  Aravinda  Kar 

Introduction 

Melt  removal  during  laser  materials  processing  is  modeled  in  this  study,  and  the  model 
results  are  compared  to  experimental  data  for  a  Chemical  Oxygen-Iodine  Laser  (COIL)  of 

wavelength  1.315jUm  .  The  basic  principle  of  COIL  has  been  discussed  briefly  by  Kar  et 

al'.  Two  important  applications  of  melt  removal  are  laser  cutting  and  laser  grooving.  The 
effects  of  assist  gas  pressure  on  laser  cutting  have  been  studied  by  Chryssolouris  and 
Choi^,  Biermann  et  al.^  and  Molian  and  Baldwin'*.  Schuocker  and  AbeP  presented  a 
mathematical  model  for  the  material  removal  process  during  laser  cutting.  The  physical 
mechanism  of  the  process  has  been  discussed  by  Schuocker®.  Mathematical  models  for  the 
evaporative  cutting  have  been  developed  by  Modest  and  Abakian’,  Bang  and  Modest  and 
Roy  and  Modest®.  Belie  and  Stanic‘°,  and  Belie"  analyzed  the  laser  cutting  process  with  a 
simple  model  involving  empirical  parameters.  The  purpose  of  this  paper  is  to  model  the 
effects  of  assist  gas  pressure  on  the  cut  depth  and  kerf  width.  This  work  is  based  on  the 
model  in  Ref.  [12]  where  several  physical  effects  such  as  the  convection  and  shock  wave 
phenomena  have  been  neglected.  These  effects  are  analyzed  in  this  study  to  show  that  the 
cutting  speed  decreases  when  the  assist  gas  pressure  is  increased  above  a  critical  value 
leading  to  the  shock  formation  above  the  substrate  surface. 

Mathematical  model  for  melt  and  cut  depths 
A.  Energy  and  mass  balance  equations 

The  proposed  model  is  based  on  the  lumped  parameter  technique'^  in  which  the  overall 
energy  and  mass  balances  are  considered  instead  of  the  pointwise  distribution.  The  overall 
energy  and  mass  balance  equations  are,  respectively,  written  as 
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(1) 


APx  =  pw,(d^  -  dj(v  +  VJx[c/T„  - T,)  +  L„+  c/T,  -Tj]  +  p V.vtpi, 
■^fct9^’^A,'\cp(T„  -  To)  +  Z„  +  c/r,  -  T„j\  + 

pw^vi^t  =  pw^(v  +  F„  )[d„  -  d^x  +  fa9^,vd„x  (2) 

The  mass  of  the  metal  vapor  is  not  included  in  the  mass  balance  Eq.  (2)  because  it  is  very 
small  as  compared  to  the  mass  of  the  melt.  The  first,  second  and  third  terms  on  the  right 
side  of  Eq.  (1)  represent  the  total  amount  of  material  processed  (melted  and  boiled).  Tlie 
first  term  accounts  for  the  energy  associated  with  the  material  that  leaves  the  depressed 
region  at  the  base  of  the  kerf  as  shown  in  the  Fig.  1 ,  as  a  result  of  the  impinging  jet  ; 
whereas  the  second  term  accounts  for  the  energy  required  to  boil  a  fraction  of  the  molten 
material.  The  third  term  accounts  for  the  energy  loss  as  a  result  of  the  splashing  or 
entrainment  of  molten  droplets  by  the  assist  gas,  and  the  sticking  of  molten  layers  to  the 
side  walls  of  the  kerf.  denotes  a  fraction  of  the  total  amount  of  material  melted  by  the 
laser  beam.  ^^e  energy  loss  from  the  melt  to  the  surrounding  solid  region  in  the 

workpiece  due  to  heat  conduction.  E^^^^  is  the  convective  heat  loss  from  the  melt  to  the 
assist  gas  at  the  free  surface  of  the  liquid  metal.  P  is  the  power  of  incident  laser  beam  and 
A  is  the  absorptivity  of  the  substrate.  and  are  respectively  the  melting 

temperature,  characteristic  temperature  of  the  melt  pool  and  ambient  temperature,  d^  and  d^ 
represent  the  melt  and  cut  depths  respectively.  L^,  and  are  respectively  the  latent 
heat  of  melting,  latent  heat  of  boiling,  specific  heat  capacity  of  the  substrate  at  constant 
pressure  and  the  average  velocity  of  liquid  metal  in  the  kerf.  The  laser-substrate  interaction 

time  is  defined  as  T  =  Z/v  where  I  is  the  length  of  the  rectangular  laser  spot  and  v  is  the 

scanning  velocity.  I  is  measured  in  the  scanning  direction. 

The  energy  loss  due  to  the  heat  conduction  is  deteimined  by  using  the  Fourier  law 
of  heat  conduction  and  is  given  by  the  following  expression' 
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(3) 


F  =  d 

^cond  “m 


wjc  +  2lk 

2(ax)''' 


where  k  is  the  thermal  conductivity  and  a  is  the  thermal  diffusivity  of  the  workpiece. 


Laser  beam 


FIG.  1.  High  speed  assist  gas  is  carving  out  a  kerf  in  molten  metal. 


Heat  loss  due  to  convection  as  a  result  of  the  assist  gas  flowing  over  the  melt 
surface  is  given  by 

E..„.  =  hw^JT,-TJx  (4) 

where  is  the  surface  temperature  of  the  melt  and  is  the  temperature  of  the  assist  gas  at 
the  nozzle  exit,  which  will  be  determined  later.  is  an  unknown,  however,  it  is  taken  to 
be  the  characteristic  temperature  of  the  melt,  T^,  in  the  subsequent  calculations.  This 
overestimates  the  heat  loss  due  to  convection  but  the  convection  heat  loss  is  still  found  to  be 

insignificant  as  shown  later.  The  average  heat  transfer  coefficient,  h  is  estimated  by  using 
the  following  expression  that  is  applicable  to  a  single  slot  nozzle'^ 
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3.06Pr^  ^^  Re” 
h  =  -  ^ 
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+  3.06 


(5b) 


Pr,  itg  are  respectively  the  Prandtl  number  and  thermal  conductivity  of  the  gas.  y  is  the 
perpendicular  distance  from  the  line  running  lengthwise  through  the  center  of  the  nozzle.  W 
is  the  width  of  the  nozzle  and  H  is  the  height  of  the  nozzle  from  the  workpiece.  The 

Reynolds  number,  Re  is  given  by  Re=  Vp^jv^,  where  V'  and  are  respectively  the 

velocity  and  kinematic  viscosity  of  the  gas  at  the  nozzle  exit.  The  hydraulic  diameter,  is 
given  by  =  AAJP^,  where  and  P^  are  the  cross-sectional  area  and  perimeter  of  the 

nozzle  respectively,  k^,  and  Pr  are  evaluated  at  one  atmospheric  pressure  and 

temperature  T^. 

B.  Gas  dynamic  analysis  for  the  assist  gas 

The  assist  gas  expands  after  leaving  the  nozzle  depending  on  the  ambient  pressure,  and  its 
velocity,  is  given  by 


M, 


for  j  =  u  or  J  as  discussed  below. 


(6) 


where  7  is  the  specific  heat  ratio  which  is  also  known  as  the  isentropic  index.  It  is  given 
by  y  =  c'pjc[  where  c'  and  c[  are  the  specific  heat  capacities  of  assist  of  the  assist  gas  at 
constant  pressure  and  volume  respectively.  For  diatomic  gases  such  as  Nitrogen,  7  =  1.4. 
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R  is  the  universal  gas  constant  and  7^  is  the  room  temperature  which  is  taken  as  300  K  in 
this  study.  M*  is  a  dimensionless  velocity  of  the  assist  gas  depending  on  its  flow 

Y 

y-l 

where  is  the 


conditions.  The  critical  pressure,  P*,  is  defined  as  P*  =  Pg  - - 


ambient  pressure.  Depending  on  the  assist  gas  pressure  at  the  nozzle  mouth,  P„,  three 
kinds  of  flow  situations  of  interest  are  listed  below. 


(i)  Supersonic  flow  (P„  >  P*)  in  which  the  shock  wave  is  formed  between  the  nozzle  and 
substrate  surface;  This  may  occur  in  grooving,  or  through  cuts  of  small  kerf  width  relative 
to  the  nozzle  dimensions.  The  dimensionless  velocity  at  the  upstream  of  the  shock  wave  is 
given  by“‘ 


(y-^)  UJ 


(7) 


The  dimensionless  velocity  at  the  downstream  of  the  shock  wave,  Mj*  is  given  by  the 
Prandtl  or  Meyer  relation'®,  Mj*  =  1/  M„‘.  It  should  be  noted  that  the  dimensionless 
velocity  at  the  upstream  of  the  shock  front  is  taken  equal  to  the  dimensionless  velocity  at  the 
nozzle  exit.  The  velocity  with  which  the  assist  gas  enters  the  kerf  is  obtained  from  Eq.  (6) 
by  setting  M*  =  M^*.  The  dimensionless  velocity  M*  is  related  to  the  Mach  number  M  j 
through  the  following  relation'®. 


1- 


Y-7 
y  +  1 


(8) 


where  j  -  u  ot  d  signifies  the  upstream  and  downstream  conditions  respectively.  The 
temperatures  and  of  the  assist  gas  at  the  upstream  and  downstream  of  shockfront, 
respectively,  are  given  by  the  following  expressions'® 
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7  =  M  or  d 


(9) 


y-1 

2 


(ii)  Supersonic  flow  (P„  >  P*)  in  which  the  shock  wave  is  formed  at  the  kerf  inlet;  The 
velocity  at  the  nozzle  exit  is  obtained  from  Eq.  (6)  by  setting  M‘  =  M‘.  The  flow  will 

be  sonic  inside  the  kerf  eventhough  the  flow  is  supersonic  outside  the  kerf.  This  situation 
is  encountered  for  through  cuts  of  uniform  width  comparable  to  the  nozzle  width.  This  is 
the  maximum  gas  velocity  that  can  be  attained  in  the  kerf  for  material  removal  to  achieve  the 
highest  cutting  rate. 


(iii)  Subsonic  flow  (P„  <  P‘) ;  The  velocity  F/  at  the  nozzle  exit  is  obtained  from  Eq.  (6) 

by  setting  M-  =  M‘.  In  through  cuts,  the  velocity  of  the  gas  inside  the  kerf  can  be 
calculated  by  treating  the  kerf  as  a  convergent  or  divergent  nozzle  if  the  kerf  tapers  in  or  out 
respectively. 

C.  Melt  flow 


The  assist  gas  removes  the  energy  from  the  laser-material  interaction  zone  through  two 
mechanisms:  (i)  Surface  effect  which  involves  the  convective  cooling  at  the  free  surface  of 
the  melt,  and  (ii)  Bulk  effect  which  refers  to  the  removal  of  energy  contained  in  the  melt 
that  is  flushed  aw’ay  by  the  assist  gas.  The  average  velocity  of  the  molten  metal  is 
calculated  by  considering  the  Poiseuille  flow  of  the  melt'’. 


(10) 


The  viscosity  of  the  molten  metal,  n 


depends  on  temperature  as  p  = 


where  and  E  are  constants  and  their  values  can  be  found  in  Ref.  [18].  The  pressure 

gradient  dp/dx  in  the  metal  is  estimated  by  taking  it  to  be  the  pressure  gradient  of  the  assist 
gas  as  it  flows  over  the  melt  surface.  Comer  flow  theory  is  used  to  analyze  the  assist  gas 
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velocity.  A  stagnation  point  exists  below  the  gas  jet,  and  the  gas  accelerates  as  it  moves 
away  from  this  point.  The  velocity  of  an  ideal  fluid  flowing  along  a  90°  corner  of 

stationary  solid  surface  is  given  by’’’  =  V^ljH  at  a  distance  I  from  the  stagnation 

point.  In  the  present  case  of  melt  removal,  the  assist  gas  transfers  momentum  to  the  liquid 
metal  to  induce  flow  in  the  melt  giving  rise  to  a  moving  boundary  for  the  gas  flow  instead 
of  the  stationary  surface  used  in  the  comer  flow  theory.  This  will  significantly  reduce  the 
assist  gas  velocity  compared  to  the  value  of  given  by  the  above  expression.  To  account 

for  this  effect,  is  taken  to  be  =  FV^ljH  where  F  is  a  fraction  (0<F<1).  The  pressure 
gradient  is  approximated  as  follows  by  using  the  Bernoulli  equation, 

dp  _  Ap  _  Po^c  _  Po^  g 
dx"  1  21  ] 

where  is  taken  as  the  density  of  the  assist  gas  at  one  atmospheric  pressure  and 
temperature  300  K. 

The  characteristic  temperature  of  the  melt,  in  Eqs.  (1)  and  (4)  is  obtained  by 
considering  the  stefan  condition  at  the  solid-liquid  interface  which  yields' 

T  -T„  =  e^^^vwi^+e-^ — 

“  “  2kl  2(a/w,) 

e  is  the  average  thickness  of  the  recast  layer  in  the  kerf  and  is  taken  to  be  0. 1  mm  in  this 

study.  Combining  Eqs.  (1)  and  (2)  and  utilizing  Eqs.  (3),  (4)  and  (12),  the  melt  depth  is 
found  to  be 

= _  A  _ ^ _ =_ 

p  vw^  +  +  A3(v\vJ‘^^  +  A/hvW;^)  +  Ajhv'^^w^'' )  +  AJ^w,^ 

(13) 

where  the  coefficients  are  defined  as 
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(14a) 


^0 

2agkl 


9^Cp(T„-Tg) 

2a„(alw,r^ 


Wj^k  +  2kl 

2a,(alw,f^ 


{T^-T,) 


A 

2agkl 

4  - 

^  2ao(alwi^y^^ 

4  ■  (T,„  -TJ 


(14b) 

(14c) 

(Md) 

(14e) 

(14f) 

(14g) 


and 

‘'„=p{c^{T,-T„)+L„  +  ^Li4  (14h) 

For  a  rectangular  laser  spot  of  length  I  =  ]. 7  mm  and  width  w  =  1.2  mm,  the 
thermophysical  data  listed  in  Tables  I  and  II  and  the  assumptions  that  =  300  K,  £  =  0.1 

mm,  P  =  0.1  and  w^.  =  w,  the  coefficients  in  expressions  (14b-h)  are  found  to  be  = 

1.24x10’'’  [J/m^],  A/  =  233.35  [s/m'j,  =  3.29  [s‘'Vm],  A,  =0.0033  [m/s’'^].  A,  =  6.50 

X  lO”  [m.s.K/J],  A^  =  9.18  x  10’  [mlK.s''VJ]  and  A^  =  1.26  x  10’  [m’.K/J].  To 
carrying  out  the  order  of  magnitude  analysis,  these  coefficients  are  multiplied  by  the 
respective  terms  to  obtain  A/ vw^)’  =  1.34  x  10’  [mVs],  AjfvWj)’^’  =  3.87  x  10’’  [m’/s], 

A/vw,)’"’  =  1.62  X  10-^  [m’/s],  AJivw,  =  2.07x10’  [m’/s],  A^hv'V^  =  1.46  x  lO’’ 
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[mVs]  and  A^hw,^  -  1.67  x  10’  [mVs]  where  v  =  1.2  m/s.  The  value  of  h is  calculated  for 

the  largest  nozzle  of  length  22  mm  and  width  VK  =  1  mm  for  the  conditions  at  which  the 
assist  gas  attains  sonic  velocity  at  the  kerf  inlet.  For  this  situation  the  temperature  of  the  gas 
is  found  to  be  =  250  K  by  taking  the  Mach  number  =  1  in  Eq.  (9),  The  distance 
between  the  nozzle  and  workpiece  (H)  and  the  parameter  y  are  taken  as  6  mm  and  20  mm 


respectively  in  Eq.  (5a).  The  average  value  of  the  heat  transfer  coefficient,  /i=1708 
[W/m’K]  under  these  conditions.  These  magnitudes  indicate  that  the  terms  involving  A3  is 
dominant  in  affecting  the  melt  depth  compared  to  the  rest  of  the  terms.  Therefore,  Eq.  (13) 
can  be  simplified  as 


The  assist  gas  removes  the  melt  to  produce  a  slot  of  depth  in  the  case  of  laser  grooving. 

is  equal  to  the  thickness  of  the  workpiece  for  laser  cutting  applications.  The  mass 
balance  Eq.  (2)  relates  d^  to  d^  as  given  below. 


(16) 


The  splashing  function is  not  a  constant.  It  is  zero  when  the  assist  gas  pressure  is  zero 
and  progressively  increases  as  the  pressure  increases.  Since  the  gas  pressure  affects  the 
melt  velocity  V|„ ,  the  splashing  function  is  defined  in  terms  of  as  follows, 


/.= 


^.v  +  F,„ 


(17) 


where  Xj  is  a  suitable  constant.  Utilizing  Eqs.  (10),  (15),  (17)  and  setting  in  Eq. 

(10),  Eq.  (16)  can  be  written  as 
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(18) 


d  = 


^  dp^ 

ydXj 


w 


^W+12X,\iv 


TABLE  I.  Typical  values  of  thermophysical  properties  of  iron.  Due  to  the  lack  of 
availability  of  high  temperature  data,  these  values  were  used  in  this  study  to  obtain  various 
results  for  400  series  stainless  steel  workpiece'®. 


Thermophysical  properties 

Values  of  thennophysical 

properties 

Density,  p 

7870  kg  m’® 

Melting  point, 

1809  K 

Boiling  point,  7^ 

3133  K 

Specific  heat, 

456  J  Kg  ' K 

Thermal  conductivity,  k 

78.2  W  m  '  K  ' 

Thermal  diffusivity,  05=  kl(pc^) 

2.18x10-'  m^  s-' 

Latent  heat  of  melting, 

2.72x10'  J  Kg-' 

Latent  heat  of  boiling,  L,, 

6.10x10' J  Kg-' 

Dynamic  viscosity, 

0.3699  m  N  s  m-^ 

Activation  Energy,  E 

41.4  kJ  mol"' 
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TABLE  n.  Thermophysical  properties  of  nitrogen'^. 


Thermophysical  properties 

Values  of  thermophysical 

properties 

Pressure, Pg 

1.013x10^ 

Density,  Pg  (300  K,  1  atm) 

1.1233  kg  m-" 

Kinematic  viscosity,  Vg  (250  K,  1  atm) 

1 1.48x10®  mVs 

Prandtl  number,  Pr  (250  K,  1  atm) 

0.727 

Thermal  Conductivity,  k^(250  K,  1  atm) 

22.2x10-^  W  m-'  K  ‘ 

Model  for  kerf  width 

The  width  of  the  slot,  which  is  referred  to  as  kerf  width  in  laser  cutting  applications, 
produced  due  to  melt  removal  by  an  assist  gas  is  obtained  by  modifying  the  Rosenthal 
equation  for  temperature  distribution^®.  The  Rosenthal  model  is  based  on  a  point  heat 
source,  a  semi-infinite  workpiece,  no  heat  losses  due  to  convection  and  radiation,  and  no 
melting  of  the  workpiece.  To  account  for  the  reduction  in  temperature  due  to  the  melting  of 
the  workpiece  and  the  removal  of  the  superheated  melt  from  the  laser-material  interaction 

zone,  the  Rosenthal  equation  is  modified  by  incorporating  a  factor /„,  0  <  /„  <  1,  as  given 
below. 

vr 

PA  e  ..Q. 

T-To  = - -  (19) 

2Kk  r 

where  r  =  +  /-(-/.  The  melt  width  is  approximated  by  determining  the  width  of  the 

region  at  the  substrate  surface  (x  =  z  =  0)  within  which  the  temperature  is  above  the  melting 
temperature.  To  obtain  a  simple  expression  for  the  melt  width  ,  r  is  set  equal  to  /2 
in  the  denominator  and  is  taken  as  w/ 2  in  the  exponent  in  Eq.  (19),  which  reduces  Eq.  (19) 
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to  the  following  form. 


”  ^^nk[T^-T,)^  (20) 

Some  of  the  molten  material  flows  along  the  walls  of  the  slot  or  kerf  and  most  of  it  is 
splashed  out  by  the  assist  gas  (see  Fig.  2).  This  produces  a  slot  or  kerf  of  width  w^.  which 
is  related  to  the  melt  width  by  the  following  mass  balance  equation. 

=  p{Vn,  +  v)(w„  -  yi',)d„x+  f^pvwj^x  (21) 


Molten  metal 
flowing  along  the 
wall  of  the  kerf 


Work  piece 

7  /  / 

^  /  -A _ 

roetaJ-beiug^^ 

- - - 

FIG.  2.  Top  view  of  the  work  piece  showing  how  assist  gas  has 
squeezed  metal  along  the  walls  of  the  kerf. 


The  first  term  on  the  right  hand  side  of  this  equation  corresponds  to  the  amount  of  material 
sticking  along  the  kerf  walls  while  the  second  term  represents  the  amount  splashed  out. 
Here/„  is  a  splashing  function  of  similar  nature  as  in  the  case  of  the  cut  depth.  It  is  zero 
when  the  assist  gas  pressure  is  zero  and  increases  progressively  as  the  pressure  increases. 
As  in  section  11,4  is  defined  in  terms  of  in  the  following  way, 

Jr.v+F,  (22) 

where  is  a  suitable  constant.  Utilizing  Eqs.  (10),  (22)  and  setting  w,=h-  in  Eq.  (14), 
the  expression  for  the  kerf  width  can  be  written  as  follows  from  Eq.  (21). 
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w,= 


ydXj 


W  +12X^\iv 


(23) 


Pressure  of  the  assist  gas  at  the  nozzle  mouth,  P  [MPa] 


FIG.  3.  Plot  of  Prandt!  or  Meyer  relationship  as  a  function  of 
assist  gas  pressure  at  the  nozzle  mouth.  The  upper  and  lower 
curves  indicate  the  velocity  of  gas  at  upstream  and  downstream 
side  of  the  shock  respectively. 


Results  and  discussion 

The  melt  and  cut  depths  and  the  kerf  width  are  calculated  by  using  the  above  models  and 
the  data  listed  in  Tables  I  and  11.  The  results  are  compared  with  experimental  data  to  verify 
the  models.  Details  of  the  experimental  set  up  and  procedure  have  been  presented  in  Ref. 
[1].  The  values  of  the  parameters,  F,  X^,  /„  and  are  chosen  to  be  F=0.05, 
Xj=X^^=0.02  and  f^=0.21  to  obtain  similar  trends  as  experimental  data.  The  physical 
meaning  of  choosing  Xj=X^=0.02  is  that  this  choice  leads  to  maximum  cut  depth  over  a 
wide  range  of  cutting  speed  for  sonic  gas  velocity  at  the  inlet  of  the  kerf  of  uniform  width. 

Figure  3  is  a  plot  of  the  Prandtl  or  Meyer  relation  for  the  case  in  which  the  assist 
gas  attains  a  critical  speed  at  about  0.2  MPa  gas  pressure  at  the  nozzle  mouth  and  a  shock 
wave  is  formed  at  an  intermediate  location  between  the  nozzle  and  workpiece.  The  curve 
bifurcates  at  this  critical  speed.  The  upper  curve  represents  the  assist  gas  velocity  at  the 
upstream  of  the  shock  wave  and  the  lower  curve  denotes  the  velocity  downstream.  The 
velocity  of  the  downstream  gas  is  always  less  than  the  critical  speed  of  the  gas.  Figure  3 
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indicates  that  the  velocity  with  which  the  assist  gas  enters  the  kerf  decreases  if  a  shock  front 
is  formed  between  the  nozzle  and  workpiece.  This  reduces  the  effectiveness  of  the  assist 
gas  in  removing  the  molten  material 


20 


15 


10 


0  ' 
0.01 


\  ' 
\ ' 
.A 
\  \ 

•  • 


•  Nozzle  1mm  X  22.9mm 

♦  Nozzle  0.6mm  X  14.5nim 
■  Nozzle  0.6mm  X  2,5mm 

P„  =  0'9MPa 

-  -A=0.45 
-  -  A=0.55 


\\\ 


0.1  1 
v*\v  [m/min]*mm 


10 


FIG.  4.  Verification  of  the  scaling  law  for  cut  depth 
with  the  experimental  data. 


FIG,  5.  Variation  of  cut  depth  with  the  assist  gas  velocity  at 
the  nozzle  e.xit  for  different  scanning  velocities. 


Figure  4.  shows  the  cut  depth  per  unit  power  ( P)  as  a  function  of  the  product  of  laser 
beam  width  and  scanning  velocity  v.  Four  different  absorptivity  values  in  the  range  of 
0.35  to  0.55  are  used.  The  theoretical  curves  closely  follow  the  experimental  data  and 
validates  the  theory.  Figure  5  represents  the  variation  in  the  cut  depth  with  assist  gas 
velocity  at  the  nozzle  exit.  The  curves  show  zero  cut  depth  for  zero  gas  velocity,  rise 
sharply  and  then  attain  constant  values.  This  can  be  explained  on  the  basis  of  the  kerf 
geometry  which  is  assumed  to  be  nontapered  through  cut.  Due  to  this  the  velocity  of  the 
assist  gas  remains  equal  to  the  critical  value  in  the  kerf  even  though  the  velocity  of  the  gas 
may  be  supersonic  outside  the  kerf.  The  curves  indicate  that  the  maximum  cut  depth  is 
achieved  at  gas  velocities  much  lower  than  the  critical  speed.  At  higher  scanning  speeds  the 
maximum  cut  depth  is  achieved  at  higher  gas  velocities  though  still  less  than  the  critical 
speed  because  the  gas  jet  interacts  with  the  workpiece  for  shorter  duration.  Figure  6 
denotes  the  corresponding  results  in  terms  of  the  gas  pressure  at  the  nozzle  mouth  and 
essentially  explains  the  same  fact  as  in  Fig.  5.  Figure  7  shows  the  variation  in  the  cut  depth 
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with  the  scanning  velocity.  The  cut  depth  drops  sharply  for  higher  scanning  speeds 
because  the  laser-material  interaction  time  decreases  and  therefore,  laser  beam  deposits  less 
energy  on  the  workpiece  when  the  beam  is  scanned  at  the  higher  speeds. 


Assist  gas  pressure  at  the  nozzle  mouth,  P  [MPa] 


FIG.  6.  Variation  of  cut  depth  with  pressure  of  assist  gas  at 
the  nozzle  mouth  at  different  scanning  velocities 


0  0.5  1  1.5  2 

Scanning  velocity,  v  [m/min] 

FIG.  7.  Trends  showing  cut  depth  varying  with  the  scanning 
velocity  at  different  laser  power. 


The  kerf  width  per  unit  power  [wjp^  is  plotted  against  the  product  of  the  laser 

beam  width  w  and  the  scanning  velocity  v  in  Fig.  8.  Experimental  data  and  model  results 
show  similar  trends.  Figures  9  and  10  examine  the  kerf  width  as  a  function  of  the  assist 
gas  velocity  at  the  nozzle  exit  and  pressure  at  the  nozzle  mouth  respectively.  The  kerf 
width  rises  sharply  as  the  gas  velocity  or  pressure  increases,  and  then  attains  a  constant 
value  as  the  sonic  velocity  is  approached  due  to  the  geometry  of  the  kerf  already  mentioned 
above  in  the  case  of  the  cut  depth  (Fig.  5).  Figure  1 1  shows  that  the  kerf  width  decreases 
as  the  scanning  velocity  increases  because  of  less  energy  deposition  on  the  workpiece  at 
higher  scanning  speeds. 

Figure  12  examines  the  cutting  speed  as  a  function  of  the  assist  gas  pressure  at  the 
nozzle  mouth,  and  the  model  results  are  compared  with  experimental  data  taken  from  the 
Ref.  [21].  The  results  are  shown  for  two  cases  involving  2  mm  and  3  mm  cut  depths.  The 
experimental  data  indicate  that  the  cutting  speed  increases  as  the  gas  pressure  is  increased 
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Kerf  width,  w  [mm] 


v*w  [m/min]*mm 


FIG.  8.  Verification  of  scaling  law  for  cut  depth  with 
experimental  data. 
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Assist  gas  velocity  at  nozzle  exit,  V  '  [m/s] 


FIG.  9.  Variation  of  kerf  width  with  the  assist  gas  velocity 
at  nozzle  exit  for  different  scanning  velocities. 


Pressure  of  the  assist  gas  at  nozzle  mouth.  [MPa]  Laser  Scanning  Velocity,  v  [m/min] 


FIG.  10.  Variation  of  kerf  width  with  the  assist  gas  pressure  FIG.  1 1.  Trends  showing  kerf  width  varying  with  the  scanning 

at  the  nozzle  mouth  for  different  scanning  velocities  velocity  at  different  laser  powers. 
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upto  the  critical  pressure  (0.2  MPa)  and  then  drops  on  further  increase  of  pressure.  This 
trend  is  predicted  by  the  present  model  on  the  basis  of  the  assumption  that  a  shock  wave  is 
formed  at  an  intermediate  location  between  the  nozzle  and  workpiece  for  supersonic  gas 
flow.  The  flow  is  then  governed  by  the  Prandtl  or  Meyer  relation  and  the  velocity  of  the 
gas  at  kerf  inlet  is  always  less  than  the  critical  velocity.  In  the  pressure  range  0.1  <  P„< 
P*=  0.2  MPa  the  velocity  of  the  metal  is  calculated  on  the  basis  of  M„*,  and  is 

determined  by  using  Afj*  for  >  P*.  It  may  be  noted  that  the  effective  nozzle-to-plate 

distance  becomes  H/2  for  supersonic  gas  flow  because  the  shock  wave  is  assumed  to  form 
halfway  between  the  nozzle  and  workpiece.  It  is  also  assumed  that  the  kerf  is  of  nontapered 

geometry  and  that  the  size  of  the  laser  spot  is  0.25mm  x  0.25mm.  For  this  spot  size  is 
chosen  to  be  0.0088. 


FIG.  12.  Experimental  data  taken  from  Ref.  [21]  is  compared 
with  the  theoretical  results  for  2mm  and  3mm  depths  of  cut, 
for  a  laser  spot  size  of  dimensions  0.25mm  X  0.25mm 

Conclusions 

Models  for  melt  and  cut  depths  and  kerf  width  are  presented  for  laser  grooving  and  cutting 
applications.  Experimental  data  for  both  cut  depth  and  kerf  width  validate  the  model 
predictions.  The  cutting  speed  reaches  a  maximum  value  at  a  certain  pressure  of  the  assist 
gas.  It  decreases  when  the  gas  pressure  is  increased  beyond  the  critical  pressure  for  certain 
geometry  of  the  kerf,  and  nozzle  dimensions. 
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Abstract 

The  properties  of  nano-structured  plastics  are  detennined  by  complex  relationships  between 
the  type  and  size  of  the  nano-rcinforccmcnt,  die  interface  and  chemical  interaction  between  the  nano- 
reinforcement  and  die  pol}ineric  chain,  along  with  macroscopic  processing  and  microstructural 
clfecLs.  Rccendy,  families  of  funcdonallizcd  polyhedral  oligomeric  silscsquioxanc  (POSS)  macromers 
bearing  various  pol>ancrizable  org-anic  groups  that  can  be  chemically  attached  to  common  pohnicr 
such  as  polyshTcne  to  fonn  random  copohiner.  These  hybrids  show  man)'  interesting  physical  Mid 
dicnnal  propcrdcs  differ  than  die  host  organic  poKiiicrs.  However,  due  to  rigid  natnre  of  inorganic 
eomponent,  it  is  often  difficult  to  characterize  desired  properties  using  conventional  dicmial-physical 
techniques.  In  diis  study,  w'c  use  techniques  such  as  jiositron  annihilation  lifetime  speeUoscopy 
(PALS)  as  a  quanUtativc  method  to  examine  dicnnal  transitions  and  nanoscopic  structnre  of  these 
hybrid  eopohincrs.  In  addidon  to  hybrid  dicnnoplastic  pohincrs,  w'c  also  examined  dicnnal  and 
viscoelastic  responses  of  diermosct  epoxy  reinforced  with  cpoxy-funcdonallizcd  POSS.  It  w'as  found 
diat  POSS  cages  proiidc  topological  constraints  that  hinder  die  motion  of  crosslink  junctions.  Thus, 
W'C  w'cre  able  to  improve  dicnnal  performance  of  epoxy  ivith  no  additional  dinieulty  in  processing. 
Furdicrmorc,  high  temperature  mechanical  pcrfornianec  was  also  improi'cd. 
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CHARACTERIZATION  OF  THERMOPLASTIC 
INORGANICORGANIC  HYBRID  POLYMERS 


Andre  Lee 
Associate  Professor 

Department  of  Materials  Science  and  Mechanics 
Michigan  State  University 

INTRODUCTION 

The  design  of  new  materials  with  enhanced  properties  continues  to  be  a  driver  for  the 
investigation  of  hybrid  materials.  As  hybrid  materials  are  copolymers  based  on  inorganic  and  organic 
comonomers,  they  display  enhanced  properties  by  bridging  the  property  space  between  two 
dissimilar  types  of  materials.  A  typical  hybrid  material  will  contain  a  crosslinked  inorganic  phase 
covalendy  bound  \vith  an  organic  phase.  Depending  on  the  relative  amounts  of  two  components,  the 
properties  of  the  resulting  hybrid  are  intermediate  between  that  of  an  inorganic  and  an  organic 
pol}Tner.  Such  methodology  can  be  used  to  create  either  plastic  inorganics  or  toughened  plastics, 
and  is  superior  to  traditional  blending  methods.  However,  as  most  hybrid  materials  are  obtained 
through  a  so-call  "sol-gel"  type  process.  This  approach  can  only  be  use  to  create  crosslinked 
polymers.  Therefore  a  more  general  approach  to  material  chemistry  is  desired. 

Polyhedral  Oligomeric  Silsesquioxanes  (POSS)  reagents,  monomers  and  polymers  represent 
such  emerging  technology  as  a  new  chemical  feedstock  for  the  preparation  of  nano-reinforced  hybrid 
plastics.  The  establishment  of  broad  chemical  trees  of  monomeric  POSS  reagents  and  resins  based 
on  well-defined  POSS  cages  affords  new  chemiced  feedstocks  for  the  modification  of  properties  in 
nearly  all  traditional  organic  polymeric  materials. 

POSS  reagents  combine  a  hybrid  inorganic-organic  composition  with  nano-sized  cage 
structures  ha\ang  dimensions  comparable  to  those  of  most  polymeric  segments  or  coils.  Hence 
incorporation  of  POSS  reagents  into  linear  thermoplastic  or  thermoset  networks  can  be  used  to 
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modify  the  composition,  local  structure,  and  chain  mobility  in  pol>Tncric  systems.  These 
modifications  can  ultimately  affect  the  thermal,  oxidative,  and  dimensional  stability  of  many 
polymeric  resins  thus  upgrading  their  properties  for  numerous  high  performance  engineering 
applications.  The  current  difficult}'  for  fuitlicr  advancement  of  tliesc  materials  is  the  concern  about 
the  processing  into  engineering  structures.  In  this  study,  we  arc  to  examine  two  different  approaches: 
(1)  melt  processing  of  tliermoplasdc  type  hybrid  materials,  imd  (2)  thermal  casting  process  for 
thennoset  high  crosslink  density,  high  performance  hybrids. 

BACKGROUND 

Positron  Annihilation  Lifetime  Spectroscopy 

The  successful  application  of  positron  annihilation  lifetime  spectroscop}'  (PALS)  as  a 
quantitative  mctlrod  for  stud}'ing  the  structural  characteristics  of  poh'mcric  materials  depends  on  an 
undersUmding  of  the  behavior  of  the  positron  injected  into  die  substance.  It  has  found  tliat  in  all 
known  polymers  die  positron  exist  as  a  free  positron  or  form  a  bound  state  with  die  electrons  from 
die  medium,  which  is  dicn  known  as  positronium  (Ps).  The  tipical  lifedme  of  die  positron  is  in  die 
range  0.3  to  0.5  nano-seconds,  while  the  lifedme  of  Ps  depends  on  die  spin  state.  The  singlet  state  or 
parapositronium  (p-Ps)  annihilates  in  0.1  to  0.2  nano-seconds,  and  investigations  show'  that  its 
lifedme,  xi  ,  as  well  as  the  c  lifedme,  v  ,  arc  not  sensidve  to  temperature  and  structural  changes.  The 
triplet  state  or  ordiopositronium  (o-Ps)  has  an  intrinsic  vacuum  lifedme  of  about  140  n:mo-seconds, 
which  in  amorphous  polymers  is  reduced  to  x’  about  1 .5  to  3  nano-seconds,  b}'  "pick-off  annihiladon 
with  an  electron  from  the  surrounding  molecules.  The  fraction  of  die  positron  foniiing  o-Ps  (Id,  and 
x-i  bodi  depend  on  the  temperature  and  structural  changes  in  die  host  glass.  It  is  widely  accepted  that 
o-Ps  exhibits  a  strong  tendcnci'  to  localize  in  microvoids  or  small  holes,  w'here  die  electron  density  is 


14-4 


lower  than  that  in  the  bulk,  and  it  has  been  found  that  v  increases  with  an  increase  in  the  hole 
volume. 

The  positronium  formation  process  in  polymeric  solids  is  not  well  understood.  The  most 
popular  models,  i.e.,  those  due  to  Ore  and  Spur,  use  completely  different  approaches  to  explain  the 
process  of  o-Ps  formation  and  there  is  no  clear  evidence  in  favour  of  one  or  the  other  at  the  present 
time.  The  experimental  results  show  that  in  porous  resins  and  other  porous  materials  h  is 
proportional  to  the  total  surface  area  of  the  pores.  It  is  assumed  that  in  glassy  polymers  h  is 
correlated  to  the  density  of  holes  in  the  material,  but  the  exact  nature  of  this  correlation,  however,  is 
not  known. 

Viscoelastic  Responses  and  Physical  Aging 

Amorphous  polymers  and  the  amorphous  regions  of  crystalline  polymers  undergo  a  process 
known  as  physical  aging.  For  all  glassy  materials,  when  cooled  from  temperature  To  above  the  glass 
transition  temperature,  T?  to  a  temperature  T»  below  Tg,  the  material  exhibits  a  slow  but  significant 
evolution  in  its  mechanical  properties.  Since  no  chemical  degradation  of  material  can  be  invoked, 
such  evolution  of  properties  has  come  to  be  called  "physical  aging".  The  occurrence  of  physical  aging 
has  been  attributed  to  the  non-equilibrium  state  of  glass  caused  when  the  amorphous  pol^nier  is 
quenched  from  above  to  below^  its  Tg.  The  non-equilibrium  nature  of  glass  makes  it 
lhermod\namically  unstable.  As  a  result,  the  thermodynamic  properties  such  as  specific  volume,  or 
cndiaply  decreases  with  the  aging,  or  annealing  time  in  the  glassy  state  as  the  pobmer  molecules  strive 
to  attain  tlieir  equilibrium  packing  and  conformation.  The  changes  in  mechanical  perfonnance  are 
attributed  to  the  changes  in  the  thermodynamic  properties.  The  phenomena  reported  most 
extensively  in  the  literature  are  concerned  with  the  gradual  increases  of  die  isochronal  viscoelastic 
modulus  due  to  shifts  in  the  relaxation  response  as  aging  proceeds,  the  reduction  in  die  creep  rate. 
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the  increase  of  strength  under  tension  or  multi-axial  deformation,  and  to  a  lesser  extent,  the  decrease 
of  the  impact  fracture  energ>'  of  tlic  aged  materials. 

In  the  Tree-volume"  dieor>^  of  die  glass  transition,  the  material  relaxation  time,  x,  is  primarily 
determined  by  free  volume,  Vf  and  not  by  temperature  direedy.  As  the  material  is  quenched  from 
above  to  below  Tg,  to  a  first  approximation,  the  value  of  vr  at  Tr  is  frozen-in  and  vr  no  longer  decreases 
as  temperature  further  cooled  to  aging  temperature  Hence  die  modulus  of  polymer  at  the  glassy 
state  remain  constant  as  function  of  temperature.  Ho\\^cver  at  T-,  due  to  effect  of  physical  aging,  die 
modulus  slowly  increases  with  time  as  die  specific  volume  slowly  decreases  .  The  interpretation  by 
the  free-volume  dieoiy^  is  physical  aging  is  related  to  slow  rehixation  of  free  volume  towewds  its 
equilibrium.  The  time  required  for  free  volume  to  reach  equilibrium  and  die  rate  of  aging  are  related 
to  excess  amount  of  free  volume  and  its  distribution,  respcctivel}\ 

As  mentioned  above,  the  phj^sical  aging  affects  die  viscoelastic  response  of  die  polymer  by 
shifting  the  mechanical,  dielectric,  etc.  relaxation  (or  retardation)  curvus  to  longer  time.  The  mnount 
of  shift  is  a«',  die  aging  rate  is  general!}^  define  by  the  double  logarithmic  shift  rate  p: 
p=  d  log  ate/  a  log  U 

As  glassy  polymer  approach  structural  equilibrium,  die  viscoelastic  responses  cease  to  shift  to  longer 
time.  Therefore,  it  is  refer  to  as  "structural  equilibrium  state”.  The  time  to  reach  diis  equilibrium  may 
l)e  control  by  die  molecular  nature  of  glassy  structure.  With  the  addition  of  POSS  cages,  we  are  to 
examine  the  impact  of  nano-clusters  on  die  overall  relaxation  of  pol>iiicric  glasses. 

EXPERIMENTAL 

Styiyl  POSS  Macromers  and  Copolymers 

The  s>aithesis  four  POSS  macromers  containing  a  single  styrene  group  of  PiR7SisOi2  v^ith  P  is 
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the  styrene  group  and  R  is  either  (cyclohexyl,  c-OHn)  or  (cyclopentyl,  c-GHs).  These  macromers 
were  synthesized  in  the  same  manner  from  the  known  trisilanols  RjSiGslOHla,  and  the  appropriate 
trichlorosilane  PSiCh.  The  purity  of  POSS-styryl  macromers  was  verified  using  proton  NMR. 
Various  amounts  of  POSS-macromers  were  used  to  produce  random  copolymers  of  different  mole 
fraction  of  POSS  segment  The  random  copolymers  were  synthesized  using  the  additional 
polymerization  technique  with  AIBN  as  initiator.  The  reaction  time  of  64  hours  at  60°C.  The 
molecular  weight  was  characterized  using  gel  permeation  chromatography  and  a  combination  of 
refractive  index  and  multi-angle  laser  light  scattering  measurements  with  a  Wyatt  Technologies 
DAWN  spectrometer. 

Epoxide  POSS  Macromers  and  Epoxy  Networks 

Meta-chloroperbenzoic  acid  (MCPBA)  was  added  to  a  solution  of  (c- 
GHii)7Si80i2CH2CH=CH2  in  methylene  chloride  and  stirred  at  23°C  for  3  days.  The  methylene 
chloride  was  removed  from  the  reaction  mixture  by  rotarj'  evaporation  and  the  resulting  solid  was 
placed  into  methanol  and  stirred  for  24  hours.  The  POSS-epoxide  product,  which  is  insoluble  in 
methanol,  was  colleeted  by  vacuum  filtration,  w'ashed  with  methanol  and  dried  under  vacuum  to  give 
of  mono-functional  POSS-epoxide.  The  product  was  verify  by  ‘H  NMR. 

The  nano-reinforced  epoxy  network  glasses  used  in  this  study  have  up  to  9  weight  percent  of 
POSS-macromer  deseribed  above.  The  conventional  epoxy  resins  used  in  this  study  were  two 
different  di-funetional  epoxies  (diglycidyl  ether  of  bisphenol  A,  DGEBA;  Dow  Chemical  D.E.R.  332, 
and  1,4-butanediol  diglycidyl  ether,  BDGE;  Shell  Chemical  Heloxy  67).  The  curing  agent  used  was 
the  diamine-terminated  polypropylene  oxide;  Huntsman  Chemical  Jeffamine  D230.  It  is  important 
to  note  here  that  the  molar  ratio  of  epoxide  (the  contribution  from  DER332  and  Heloxy  67 
combined)  and  hydrogen  (from  Jeffamine  D230)  were  equal  to  one  for  all  epoxy  network  investigated 
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in  this  study.  The  DGEBA  epoxide  monomer  ras  first  heated  at  60“C  for  tw'o  hours  to  melt  any 
crystals  present  First,  POSS-epoxide/Heloxy  67/D230  was  mixed  together  and  stirred  by  hand.  The 
mixture  was  heated  at  60°C  for  3  hours  to  promote  the  reaction  between  the  amines  and  epoxide 
functionalities.  The  preheated  DER332  was  then  added  to  the  mixture  and  stirred  by  hand  till  clear. 
The  mixture  was  then  degassed  in  a  vacuum  for  1 0  minutes  at  room  temperature.  This  resin  mixture 
was  dren  poured  into  a  glass  tube  with  a  diameter  of  12.7  mm  and  cured  in  an  air  oven  at  lOO'C  for  4 
hours,  dicn  slowly  cooled  in  oven  overnight  to  room  temperature. 

Positron  Annihilation  lifetime  Spectroscopy 

The  PALS  measurements  of  the  series  of  specimens  were  perfonned  by  using  a  conventional 
fast-timing  coincidence  mcdiod  in  a  temperature-controlled  chamber  at  a  constant  temperature  of 
25±rC.  The  drcrmal  response  was  investigated  b\’  heating  the  material  up  to  480TC.  The  cooling 
was  provide  by  die  addition  of  LNj.  All  spectrum  were  collected  at  a  constant  temperature  with 
acquisition  time  of  3  hours.  Due  to  die  long  measurement  time  needed  for  a  dicnnal  scan  (about  84 
hours),  sample  was  first  saturated  with  positron  radiation  for  6  hours  to  properly  measurement  of  I.i. 
The  positron  source  [~lxl0‘  Bq  (~  30pCi)  of  ^^Na]  was  deposited  on  die  surface  of  an  annealed 
singlc-ciystal  copper  disc.  As  a  result,  h  measured  herein  is  approximately  half  of  diat  determined 
widi  the  source  sandwiched  betw^een  two  samples.  The  spectra  were  acquired  Mid  stored  in  a 
MicroVAX-based  multichannel  analyzer.  The  lifetime  spectr  a  were  resolved  into  three  components 
by  using  PFPOSFIT  program. 

Differential  Scanning  Calorimetry 

Heat  flow  and  glass  transition  measurements  were  perfonned  widi  a  TA  Instrument  2920 
diflcrcntial  scanning  calorimeter.  This  calorimeter  is  based  on  the  modern  modulation  technique. 
By  apply  a  sine  w'avc  on  top  of  heating  ramp  (an  anahlical  Fourier  transfonnation  mediod),  one  can 
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separate  reversible  and  non-reversible  heat  flow.  Any  thermal  transition  such  as  glass  transition 
appears  as  a  slope  discontinuity  on  the  non-reversible  heat  flow  trace.  This  calorimeter  was  used  in 
this  study  to  resolve  glass  transition  for  polymer  containing  high  mole  fraction  of  POSS  segments, 
which  can  not  be  examine  using  conventional  DSC.  The  sample  was  first  heated  to  250°C  from  50°C 
at  a  rate  of  5°C/minute,  with  modulation  of  1  Hz  and  amplitude  of  tTC.  The  glass  transition 
temperature  was  determined  as  the  inflection  point  of  the  glass  transition  region  on  the  DSC 
thermograms. 

Small-Strain  Stress  Relaxation 

Mechanical  tests  were  performed  using  a  computer-controlled  servo-hydraulic  testing  machine 
(Instron  model  1321)  equipped  with  an  oven  for  temperature  control.  The  temperature 
measurements  taken  at  the  top  and  bottom  surfaces  of  the  samples  showed  a  gradient  of  less  than 
0.3°C.  Oven  stability  was  observed  to  be  better  than  ±0.2°C  for  over  3  day  period.  The  tensile  test 
specimens  were  first  annealed  for  30  minutes  at  105°C  (at  less  20°C  above  the  DSC  determined  Ts)  to 
erase  any  aging  that  occurred  during  storage  and  were  then  placed  in  the  testing  machine  at  the  testing 
temperature,  and  the  samples  were  then  monitored  as  they  aged. 

All  mechanical  tests  were  carried  out  in  uniaxial  extension  under  stress  relaxation  conditions. 
A  small  strain  was  applied  periodically  at  aging  time,  t,  that  double  with  each  test;  i.e.,  ta  =  30  minutes, 
60  minutes,  120  minutes,  etc.  The  duration,  t>i,  of  the  deformation  was  varied  so  that  the  ratio  of  P/t" 
was  constant  throughout  the  aging  experiment.  The  value  of  t/ta  in  this  study  was  0.05.  A  short 
duration  time  \vas  used  so  that  the  changes  that  occurred  in  pohiner  structures  during  the  mechanical 
experiment  would  be  minimal  and  hence  would  not  influence  the  measurements.  The  small-strain 
stress  relaxation  technique  was  used  to  examine  the  effect  of  PSS  cages  on  the  viscoelastic  response  of 
pobTOeric  glasses  during  physical  aging.  The  applied  strain  used  here  was  0.001  and  was  measured 
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with  an  extcnsometer.  The  load  was  measured  witli  an  1000  lbs.  capacit}'  load  cell. 


RESULTS  AND  DISCUSSION 

linear  Thermoplastic  Styryl-POSS  copolymers 

Differential  Scanning  Calorimetry 

Calorimetric  measurements  were  performed  for  all  st>T\'l-POSS  copol>aner  with  both 
cyclohexj'l  and  cyclopentyl-substituted  POSS  cages.  Prcxdously,  it  was  reported  lliat  no  DSC 
observable  glass  transition  for  copolymer  with  high  mole  fraction  of  POSS  segments.  However,  using 
the  modulation  DSC  technique  it  was  clear  that  one  can  determine  die  glass  transition  temperature 
even  for  POSS-homopohmer  (see  Figure  2).  Therefore,  wiUiout  any  difficulty  w'c  can  obtain  T«  for 
copohancr  \ddi  16  mole  percent  of  POSS  segments.  All  DSC  traces  were  depicted  in  Figures  2  -  7. 
The  glass  transition  temperature  for  all  specimen  studied  was  depicted  in  Figure  8.  The  following 
observadons  were  made: 

1.  In  general  as  amount  of  POSS  segments  in  copohaner  increases  the  glass  transidon  increases. 
Moreover,  die  transidon  remains  sharp,  which  indicate  the  copol>aner  was  not  phase 
separated  in  the  length  scale  of  DSC  experiment. 

2.  From  Figure  8,  we  suggest  that  cyclohexyl  subsdtuted  POSS  cages  proidded  a  more  effective 
impact  on  die  glass  transidon  temperature.  However,  the  molecular  detail  for  why  cyclohcxyl 
is  more  effeedve  than  cyclopentyl  is  sdll  not  clear  at  diis  dine. 

PALS  results 

In  Figure  9,  we  depict  the  value  of  p  at  room  temperature  (2,5"C)  for  POSS  copolymers  widi 
varying  mole  fraction  of  POSS  segments.  As  mendoned  above  diat  die  value  of  p  direedy  relied  die 
size  of  free  volume.  As  we  increase  the  mole  fraction  of  POSS  segment  in  die  copol)aiier,  die  vjilue 
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of  p  increase.  Hence  the  results  in  Figure  9  suggest  that  the  presence  of  POSS  cages  hinder  the 
packing  of  polymeric  coils. 

Figure  10,  we  depict  the  value  of  p  as  a  function  of  temperature.  Similar  to  the  DSC 
experiments,  we  observed  a  change  in  slope  at  temperature  near  the  glass  transition  temperature  as 
reported  by  DSC.  It  is  interesting  to  point  out  here  that  the  physics  of  between  these  two 
experimental  methods  are  complete  different  DSC  is  based  on  the  difference  in  heat  flow  above  and 
below  glass  transition,  while  the  value  of  p  is  related  to  the  size  of  hole.  For  sample  with  higher  mole 
fraction  of  POSS  segments,  it  became  more  difficult  to  determine  the  exact  value  of  glass  transition 
temperature.  However,  this  difficulty  is  not  presence  for  PALS  measurement  In  Figures  11  and  12, 
we  showed  versus  temperature  for  samples  with  16  mole  percent  of  POSS  segments.  The  change 
in  slope  can  easily  be  determined. 

Another  piece  of  information  that  can  be  obtain  using  PALS  is  the  number  of  holes.  Which 
is  represent  by  the  value  of  L.  In  Figure  13,  we  plotted  b  versus  temperature  for  8  mole  percent 
POSS-  MPS  copohxners.  Here,  we  observed  another  change  of  slope  at  temperature  near  200°K.  It 
is  not  clear  as  to  the  molecular  origin  of  this  change.  It  is  possible  relate  to  the  so  call  p  transition  in 
polystyrene. 

Crosslinked  Thermoset  POSS-Epoxy 
DSC  study 

Calorimetric  measurements  were  performed  for  network  glass  containing  5  and  9  weight 
percent  of  a  mono-functional  POSS-epoxy  nano-reinforcements.  Measurement  of  glass  transition  is 
routinely  used  as  part  o  resinous  material  characterization  and  provides  direct  insight  into  the  mobility 
of  polymer  chains  and  junctions.  Although  here  we  use  only  mono-functional  POSS  epoxy,  but  due 
to  the  size  of  POSS  cages,  the  mobility  of  network  junctions  were  hindered.  Thus,  as  weight  fraction 
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of  POSS  cage  increases  the  glass  transition  temperature  (mid-point  of  the  specific  heat  transition)  also 
increase.  Unlike  in  the  linear  thermoplastic  copoKmicrs,  the  onset  temperature  of  glass  transition  was 
not  observed  to  change.  This  suggests  diat  the  presence  of  POSS  cages  within  die  network  slows 
down  the  transition  from  glassy-like  behavior  to  rubbcr>'-likc  beha\ior.  Hence  die  material  maintains 
its  modulus  longer  at  elevated  temperatures.  However,  since  die  onset  temperature  is  principally 
governed  by  the  difunctional  epoxy-aminc  junctions  and  is  dicrcforc  not  afi’ccted  by  the  presence  of 
mono-functional  POSS  cages  that  chemically  attached  to  the  network. 

Isotheimal  Aging  -  Stress  Relaxation  Experiments 

The  relaxation  modulus  curve  displayed  in  Figure  1 7  arc  representative  f  die  results  obutined 
with  die  epoxy  network  glass  aged  at  a  temperature  below  die  glass  transition  temperature.  They 
correspond  to  aging  times  of  0.5  hour,  1  hour,  2  hours,  and  up  to  64  hours,  as  indicated.  In  general, 
for  a  given  sample,  it  is  always  possible  to  shift  die  relaxation  curx'e  obtained  at  different  aging  times 
horizontally  along  the  time  axis  to  form  a  time-aging  time  master  euwe,  as  shown  in  Figure  17.  To 
compare  the  relaxation  modulus  curvns  obtained  at  difierent  temperatures  and  between  die  various 
epoxy  compositions,  the  relaxation  modulus  curv'c  was  fitted  widi  a  stretched  exponential 
Kohlrausch-Williani-Watt  t>^c  function: 

E(t)  =  E.  {cxp[-(t/x)P]} 

where  E.  is  the  zero-time  tensile  modulus,  x  is  die  characteristic  relaxation  time,  luid  p  is  the  shape 
parameter  related  to  the  breaddi  of  die  sU'ess  relaxation  modulus  curvn.  The  numerical  analysis  for 
die  cunn  fits  at  each  experimental  condition  was  pcrfoniicd  using  Kalidagiaph"'  non-linear  least- 
square  data  analysis  software  on  a  personal  computer.  We  found  diat  the  value  of  p  and  E  remained 
constant  despite  the  differing  amounts  of  mono-functional  POSS-cpox\’  that  had  been  added  into  the 
glassy  network.  In  contrast,  die  breaddi  of  die  glass  transitions  was  obserx^ed  to  increase  widi  higher 
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loading  of  POSS-epoxy  in  the  network.  Furthermore,  it  is  interesting  to  note  that  when  tested  at 
different  aging  temperatures,  the  value  of  p  was  not  affected  by  the  amount  of  POSS-epoxy  presence. 
In  Figure  18,  we  depict  the  relaxation  modulus  curves  for  different  loading  of  POSS-epoxy 
monomers  after  64  hours  of  isothermal  aging  at  a  temperature  of  63.9°C.  It  is  clear  that  these 
relaxation  curves  can  be  superimposed  with  only  horizontal  shift  along  the  time  axis.  Furthermore,  it 
is  interesting  to  point  out  that  the  value  of  Eo  is  not  affected  by  the  presence  of  the  inorganie  nano- 
reinforcements.  This  may  be  in  part  a  reflection  of  the  mono-functional  nature  of  POSS-epoxy  used 
in  this  study.  Despite  the  ability  of  the  POSS  cages  to  hinder  the  relaxation  motion  of  network 
junetions  from  a  chain  terminus  location  within  the  network,  they  do  not  contribute  to  the  overall 
deformation  process  of  such  glassy  networks  from  this  position.  Interestingly  such  mono-functional 
POSS-epoxide  may  be  useful  for  enhancing  glass  transition  without  increasing  crosslinking  density 
and  potentially  detracting  from  the  desirable  mechanical  properties  of  such  epoxy  networks. 

The  characteristie  relaxation  time,  x,  of  the  networks  was  also  obtained  using  the  above 
KWW  function  from  the  data  of  tlie  stress  relaxation  experiments.  From  Figure  17,  we  shown  that 
as  the  isothermal  aging  time  increase  the  value  of  x  also  increases.  This  is  a  eommon  phenomenon 
of  glass-forming  materials  in  the  nonequilibrium  glassy  state.  As  the  material  gradually  relaxes  into  its 
equilibrium  state,  the  molecular  mobility  is  impacted.  Hence,  tlie  eharacteristic  relaxation  time  is 
expected  to  increase  wth  increased  isothermal  aging  time  until  the  material  reaches  its  structural 
equilibrium  state.  Therefore,  the  rate  of  change  in  x  is  use  as  a  measure  for  how  a  glassy  system 
approaches  its  structural  equilibrium  state.  In  Figure  19,  we  constructed  a  double-logarithmic  plot  of 
X  versus  aging  time  at  a  given  test  temperature  for  the  epoxy  glasses  containing  different  loading  of  the 
POSS  nano-reinforcements.  Similar  plots  but  at  different  testing  temperature  is  shown  in  Figures  20 
and  21.  For  aging  temperature  well  below  T«,  the  double  logarithmic  rate  of  change  in  the 
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characteristic  relaxation  times  does  not  appear  to  be  affected  by  die  presence  of  POSS  cages. 
However,  at  any  given  aging  time,  the  networks  containing  higher  loading  of  die  POSS  nano- 
reinforcemeiiLs  have  higher  characteristic  relaxation  time  values.  Increased  relaxation  time  is  also  to 
be  expected  given  the  fact  that  die  POSS-reinforced  netivork  also  shows  higher  values  of  T*. 

To  examine  the  effect  f  POSS  cages  on  the  molecular  mobility,  it  is  ncccssar>'  for  die  .system  t 
be  in  die  same  diennodynamic  state,  i.c.,  the  difference  between  testing  temperature  and  the  T.  is  the 
same.  In  Figure  22,  we  show  a  double  logarithmic  plot  of  characteristic  relaxation  time  versus  aging 
time  for  sU'ess  relaxation  experiments  perfomied  at  a  temperature  9‘’C  below  the  respective  Tg.  For 
neat  resin  sample,  wc  obsen^ed  a  change  in  the  slop  of  characteristic  rehixation  time  versus  aging  time 
idler  32  hours.  This  is  an  indication  diat  the  sample  is  close  to  its  final  sUaictural  equilibrium. 
However,  for  sample  \ridi  9  weight  percent  of  POSS-epoxi',  the  relaxation  modulus  curve  continued 
to  shift  to  longer  times,  i.e.  no  change  in  die  slope  of  characteristic  relaxation  time  versus  aging  time. 
After  64  hours  of  isothermal  aging,  die  sample  had  not  yet  reached  its  structural  equilibrium  point. 
From  diis  observation,  ive  proposed  diat  die  presence  of  POSS  cages  proilde  topological  constraints 
to  die  network  junctions  w'hich  prevent  or  sc\'erel>'  retard  die  process  which  die  network  can  reach 
die  structural  equilibrium.  Given  diat  the  physical  size  of  POSS-cages  approach  those  of  most 
polymer  dimension  die  inabilit}’  of  such  a  network  to  rehrx  can  he  anticipated.  Therefore,  the  time 
needed  to  reach  structund  equilibrium  is  longer  for  sample  with  POSS  cages  attached  to  die  network 
chains. 

CONCLUSIONS 

The  sjiithesis  of  soluble,  diermoplastic  hybrid  materials  from  well-defined  inorgiuiic  cluster 
containing  only  a  single  pohaiierization  site  has  been  deicloped.  StiTii-based  POSS  derivatives  were 
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successfully  copolymerized  with  methylstyrene.  Detail  thermal  analysis  was  carry  out  using  modem 
DSC  technique.  We  were  able  to  determine  the  glass  transition  temperature  for  those  copolymers 
with  high  loading  fPOSS  segments.  It  was  determine  that  these  copolymers  form  single  phase  system 
and  copolymers  with  more  POSS  segments  has  higher  value  of  Ts. 

Using  PAT  S  technique,  the  glass  transition  temperature  of  styryl-based  POSS  copolymer  was 
easily  determined.  This  is  due  to  the  different  physics  involving  with  measurements.  Result  shows 
that  average  size  of  hole  increases  with  POSS  contain  as  determine  by  PALS.  This  suggests  that  the 
presence  f  POSS  cages  prevent  the  formation  of  random  polymer  coils.  The  detail  of  POSS-based 
copolymer  is  to  be  investigator  further  using  small  angle  neutron  scattering  (SANS)  technique. 

Commonly  used  DEGBA  epoxy  resins  were  fonnulated  with  mono-functional  POSS-epoxy 
nano-reinforcements  and  subsequently  cured  into  structural  thenuoset  network  with  amine- 
terminated  agents.  DSC  result  shows  that  the  value  of  T*  increases  with  increased  loading  of  the 
mono-functional  POSS  epoxy  into  the  network.  This  increase  in  U  without  increasing  the  degree  of 
crosslinking  within  the  network  may  be  of  technological  important,  which  are  to  be  investigated  in  the 
future. 

The  networks  were  subjected  thermal  quenching  aird  aging  experiments  at  temperature  below 
Ts.  At  a  constant  temperature  below  the  Ts  of  the  system,  it  is  possible  to  age  these  glassy  networks 
to  the  point  where  they  reach  a  structural  equilibrium.  However  the  topological  constraints  provided 
by  the  presence  of  POSS-reinforcements,  slows  do\vn  the  motion  of  the  network  junctions  and  hence 
tire  time  needed  t  reach  structural  equilibrium  is  dramatically  increased  relative  to  that  for  non- 
reinforced  networks. 
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Figure  Captions 

Figure  1.  Schematic  of  Styryl-POSS/Methylslyrene  Copolymers 

Figure  2.  Reversible  heat  flow  trace  and  heat  capacity  trace  obtained  using  modulated  DSC  for 
homopolymer  of  Styiyl-POSS  with  cyclohexyl  group.  A  T*  of  379.2°C  is  depicted. 

Figure  3.  Heat  capacity  trace  obtained  using  modulated  DSC  for  homopolymer  of  Styryl-POSS 
with  cyclopentyl  group.  A  T*  of  372.0°C  is  depicted. 

Figure  4.  Reversible  heat  flow  trace  and  heat  capacity  trace  obtained  using  modulated  DSC  for 
copolymer  of  Styryl-POSS/MPS  with  4  mole  percent  of  POSS  segment  A  Tg  of 
109.0°C  is  depicted. 

Figure  5.  Reversible  heat  flow  trace  and  heat  capacit5'  trace  obtained  using  modulated  DSC  for 
copolymer  of  Styryl-POSS/MPS  with  8  mole  percent  of  POSS  segment  A  Tg  of 
1 18.9°C  is  depicted. 

Figure  6.  Reversible  heat  flow  trace  and  heat  capacity  trace  obtained  using  modulated  DSC  for 
copolyoner  of  Styryl-POSS/MPS  with  16  mole  percent  of  POSS  segment  A  Tg  of 
134.5°C  is  depicted. 

Figure  7.  Reversible  heat  flow  trace  and  heat  capacity  trace  obtained  using  modulated  DSC  for 
copol>aner  of  Styryl-POSS(cyclopentyl)/MPS  with  16  mole  percent  of  POSS  segment 
A  Tg  of  1 12.4°C  is  depicted. 

Figure  8.  Tg  of  Styryl-POSS/MPS  copolymers  versus  POSS  segment  content  as  obtained  using 
DSC.  Filled  symbols  represent  cyclopentyd  system  and  open  symbols  represent 
cyclohexyl  system. 

Figure  9.  xs  (Average  size  of  holes)  as  measured  using  PALS  at  room  temperature  for  styryl- 
POSS/MPS  copolymers  versus  POSS  segment  content  A  blend  is  cyclohexyl  system 
and  B  blend  is  cyclopentyl  system. 

Figure  10.  x^  (Average  size  of  holes)  as  a  function  of  temperature  of  styryl-POSS/MPS 
copolymers  with  8  mole  percent  of  POSS  segments.  Square  symbols  represent 
cycolpentj'l  system  and  circle  symbol  represent  cyclohexyl  system. 

Figure  11.  x^  (Average  size  of  holes)  as  a  function  of  temperature  of  styryl- 

POSS(cyclohexyl)/MPS  copol}Tners  with  16  mole  percent  of  POSS  segments. 

Figure  12.  p  (Average  size  of  holes)  as  a  function  of  temperature  of  styiyT 

POSS(cyclopentyl)/MPS  copoljoners  with  16  mole  percent  of  POSS  segments. 
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Figure  13. 

Figure  14. 

Figure  15. 

Figure  16. 
Figure  17. 

Figure  18. 
Figure  19. 
Figure  20. 
Figure  21. 
Figure  22. 


Is  (number  of  holes)  as  a  function  of  temperature  of  sUtj'I-POSS/MPS  copolymers 
with  8  mole  percent  of  POSS  segments.  Square  symbols  represent  cycolpentyl 
system  and  circle  symbol  represent  cyclohexyl  system. 

PHVF  (t3  *  b  or  total  hole  volume)  as  a  function  of  temperature  for  stjTyl-POSS 
(cyclohexyl)/MPS  copolymers. 

PHVF  (xs  *  h  or  total  hole  volume)  as  a  function  of  temperature  for  styryl-POSS 
(cyclopentyl)/MPS  copolymers. 

Schematic  of  curing  cycle  of  POSS-epoxy  system. 

Small-strain  stress  relaxation  modulus  curves  for  DGEBA/D230/Heloxy67/9wt% 
POSS-Epoxy  glass  at  aging  temperature  of  63.9”C.  Master  curves  offset  by  an 
arbitrary^  shift  for  clarity'. 

Small-strain  stress  relaxation  modulus  euwes  for  DGEBA/D230/Heloxy67/x  wt% 
POSS-Epoxy  glass  after  64  hours  of  isoUiermal  aging  at  a  temperature  of  63.9”C. 
Characteristic  relaxation  time  versus  Aging  time  for  DGEBA/D230/Heloxy67/x  wt% 
POSS-Epoxy  glasses  at  a  testing  temperature  of  54.  PC. 

Characteristic  relaxation  time  versus  Aging  time  for  DGEBA/D230/Heloxy67/x  wt% 
POSS-Epoxy  glasses  at  a  testing  temperature  of  58.9"C. 

Characteristic  relaxation  time  versus  Aging  time  for  DGEBA/D230/Heloxy67/x  wt% 
POSS-Epoxy  glasses  at  a  testing  temperature  of  63.9"C. 

Characteristic  relaxation  time  versus  Aging  time  for  DGEBA/D230/Heloxy67/x  wt% 
POSS-Epoxy  glass  at  testing  temperature  of  9"C  below  its  respective  T». 
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Sample;  CY7TaPS/4  [”1  C?  File;  C; AYLPOSS 1 . 005 

Size;  4.6000  mg  I J  J  O  Operator;  A  LEE 

Method;  MOD  5°C/MIM  TO  250'’C  LEE  Pun  Date;  IS-Sep-gS  07;  22 

Comment;  N2  PURGE  50ML/MIN 
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Relaxation  Modulus  (GPa) 
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IMPROVEMENT  IN  FRACTURE  PROPAGATION  MODELING 
FOR  STRUCTURAL  BALLISTIC  RISK  ASSESSMENT 


Feng-Bao  Lin 
Associate  Professor 

Department  of  Civil  and  Environmental  Engineering 
Polytechnic  University 

Abstract 

A  numencal  simulation  of  the  firing  of  a  solid  propellant  rocket  motor  has  been  performed.  The  emphasis  is  on  the 
fracture  modeling  of  the  motor.  A  two-dimensional  model  was  analyzed  using  a  finite  element  program  equipped  with 
the  nonlocal  smeared  cracking  model  to  simulate  the  fracture  behavior.  Solid  propellants  are  inhomogeneous  materials, 
and  the  influence  of  the  fracture  process  zone  needs  to  be  considered.  The  geometrical  data  for  the  finite  element  model 
were  obtained  from  previous  studies  conducted  by  Thiokol  Corporation.  The  numerical  results  of  this  study  agree  with 
Thiokol  s  test  measurements.  Debonding  is  not  considered  in  this  study.  This  nonlocal  smeared  cracking  model  may 
be  mcluded  in  the  FEINT  program  to  improve  Thiokol’s  three-dimensional  predictions  of  motor  firing. 
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IMPROVEMENT  IN  FRACTURE  PROPAGATION  MODELING 
FOR  STRUCTURAL  BALLISTIC  RISK  ASSESSMENT 


Feng-Bao  Lin 


Introduction 

The  firing  of  a  solid  rocket  propellant  motor  is  a  complex  process  and  its  risk  assessment  requires  an  integration  of 
advanced  technologies  in  diverse  disciplines.  A  numerical  simulation  of  its  firing  involves  bumback,  internal  ballistics, 
and  structural  analysis,  in  which  one  has  to  consider  thermostrain,  nonlinear  constitutive  relationship,  large  deformation, 
rate  effect,  interface  modeling,  debond  and  crack  propagation,  void  growth,  as  well  as  the  interactions  between  them. 
One  essential  aspect  in  the  ballistic  risk  assessment  is  that  cracks  and  voids  in  a  propellant  grain  and  debonds  between 
propellant  and  insulator  can  exist  in  a  motor  that  could  constitute  a  risk  to  its  successful  firing.  These  inherent 
anomalies  could  affect  the  performance  of  the  motor,  increase  the  bore  pressure,  and  thus  act  as  stress  raisers  and 
influence  its  structural  response.  In  order  to  predict,  from  a  structural  point  of  view,  whether  it  is  safe  to  fire  a  motor 
that  contains  those  defects,  fracture  mechanics  modeling  would  be  a  required  component  in  the  motor  risk  assessment 
system. 

Thiokol  Corporation  under  contract  with  Phillips  Laboratory  conducted  a  very  thorough  study  on  the  ballistic  risk 
assessment  of  solid  rocket  propellant  motors  between  June  1 988  and  May  1 995  [Graham,  et  al,  1995]  The  result  of 
that  study  is  an  integrated  computer  software  named  Structural/Ballistic  Analysis  System  (SB AS).  This  program 
includes  structural  analysis  codes:  TEXLESP-2D  and  TEXLESP-3D  [Collingwood,  et  al,1988];  bumback  analysis 
codes:  RECESS  and  SURF;  internal  ballistics  analysis  codes:  RECESS,  ITTIB,  and  VOLFIL;  crack/debond  combustion 
codes:  CCM  and  CDCA;  and  heat  transfer  codes:  HEAT,  ASCHAR,  and  SINDA.  Furthermore,  interfaces  are  provided 
for  the  following  general  commercial  structural  analysis  codes:  ABAQUS,  ANSYS,  NASTRAN,  and  TEXPAC.  Any 
of  these  commercial  codes  can  be  selected  to  perform  a  structural  analysis  instead  of  using  TEXLESP.  The  executive 
module  for  SBAS  is  the  FEINT  program.  FEINT  provides  preprocessing  (geometry  definition,  mesh  generation, 
boundary  conditions,  material  properties),  analysis  execution,  and  postprocessing  (deformed  geometiy  plots,  contour 
plots,  vector  plots,  XY  plots).  FEINT  also  provides  interfaces  between  the  various  codes  mentioned  above. 

One  of  the  primary  goals  of  Thiokols’s  study  was  to  have  a  better  qualitative  and  quantitative  understanding  of  the 
influence  of  defects  on  the  structural  response  of  solid  rocket  propellant  motor.  This  would  provide  a  means  of 
assessing  the  effect  of  motor  defects  on  a  static  test  firing  and  provide  the  test  engineers  with  a  rational  evaluation  of 
the  risks  of  firing  the  motor  so  that  actions  can  be  taken  to  minimize  damage  to  expensive  test  facilities. 

Thirteen  static  firing  tests  of  specially  designed  subscale  motors  were  carried  out  in  the  ThiokoFs  program.  To 
check  the  accuracy  of  the  numerical  simulations,  Thiokol  used  the  FEINT  program  to  predict  each  motor  performance 
before  the  firing  test  The  test  measurements  included  pressure,  strain,  and  deformation.  The  pressure  measurements 
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and  the  real-time  radiography  (RTR)  observations  together  provided  the  best  indication  of  the  growth  defects.  In  their 
study,  the  numencal  predictions  of  the  crack  criticality  were  accurate,  but  debond  and  crack  initiation  and  propagation 
speeds  were  generally  oveipredicted.  It  can  be  concluded  that  defects  are  critical  to  the  performance  of  solid  rocket 

propellant  motors,  but  ccaisiderable  discrepancies  between  numerical  predictions  and  test  results  exist,  specially  in  the 
three-dimensional  simulations. 

Tbus,  it  appears  necessary  to  include  in  FEINT  additional  fracture  mechanics  models  to  improve  the  accuracy  of 
the  predictions.  These  fracture  models  should  reflect  the  highly  strain  rate,  temperature,  and  time  dependent  mechanical 
behavior  of  solid  propeUants.  The  inhomogeneily  of  the  propellant,  which  may  induce  inelastic  fracture  process  zones, 
may  have  an  essential  influence  on  motor’s  firing  perfonnance.  The  adopted  fracture  models  should  have  the  capability 
to  simulate  this  nonlinear  behavior. 

In  this  study,  a  nonlocal  smeared  cracking  model  [Bazant  and  Lin,  1 988]  was  used  to  simulate  the  effect  of  the 
fracture  process  zone.  Usmg  a  fimte  element  program  equipped  with  the  nonlocal  smeared  cracking  model,  two- 
dimensional  models  studied  by  Thiokol  [Graham,  et  al,  1995]  under  the  Structural/Ballistic  Risk  Assessment 
Methodology  program  were  analyzed  to  see  if  the  nonlocal  model  can  better  predict  Thiokol’s  test  results. 

Current  Fracture  Mechanics  Models  in  FEINT 

The  fracture  module  in  FEINT  consists  of  three  different  fracture  analysis  methods: 

1 .  Crack  closure  integral  method  -  This  method  calculates  the  energy  release  rate  G  based  on  the  work  required 
to  reverse  crack  propagation  by  an  amount  A  A: 


dA  dA  AA 


The  work  required  to  reverse  the  erack,  W, ,  is  the  area  under  the  force  displacement  curve.  If  the  material  behavior 

is  such  that  the  force-displacement  path  from  the  open  crack  state  to  the  closed  crack  state  is  unique,  as  m  a  linear  elastic 
material,  then  the  crack  closure  integral  may  be  applicable. 

2.  Crack  opening  displacement  method  -  This  method  computes  the  stress  intensity  factors  from  the  crack  tip 
opening  displacements  [Anderson  and  Abrahamson.  1 984],  and  is  similar  to  that  used  by  ANS  YS.  the  method  is 
applicable  to  cracks  m  isotropic  linear  elastic  materials  only.  Base  on  the  LEFM,  the  crack  opening  displacements  Av, 
Au,  and  Aw  near  the  crack  tip,  where  A  indicates  displacement  differences  between  two  crack  surfaces,  are  related  to 
the  stress  intensity  factors  K, ,  K,, ,  and  Km  by: 


Av 


\i\f^ 


Au 


Aw  =  - 'JL  {r 


P\/27T 


(2) 
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where  ^  is  the  shear  modulus,  k  equals  (3  -  v)/(l  +  v)  for  plane  stress  problems  and  equals  (3  -  4v)  for  plane  strain 
problems,  and  v  is  the  Poisson’s  ratio.  Taking  the  natural  log  of  both  sides  of  the  equations  and  solving  for  K  as  r 
0  yields: 


3.  J“integral  method  -  This  method  is  based  on  the  virtual  crack  extension  method  proposed  by  Parks  [1974, 1 977], 
and  supports  linear  and  nonlinear  material  behavior,  including  thermal  strains  and  large  deformation.  The  J-integral 
method  implemented  in  FEINT  also  accounts  for  distributed  loads  on  the  crack  surfaces.  The  procedure  assumes  that 
steacfy  state  conditions  exist  around  the  crack  tip.  This  method  can  also  calculate  the  energy  release  rate,  stress  intensity 
factors,  and  the  speed  of  crack  propagation  if  the  necessary  properties  have  been  defined.  Intemodal  constraints  are 
ignored  and  should  not  be  used  inside  the  outermost  contour  of  elements. 

The  J-integral  is  defined  as  the  decrease  in  total  energy,  P,  per  unit  area  of  crack  extension.  A,  i.e., 

J  =  -  dPIdA  (4) 


Total  potential  energy  is  a  function  of  material  reference  positions  X,  displacements  u,  and  external  loads  f  For  linear 
elastic  materials,  it  can  be  shown  that  the  J-integral  is  equal  to  the  energy  release  rate  G.  For  a  virtual  crack  extension, 
6A,  the  above  equation  can  be  written  in  terms  of  virtual  quantities  as 


where  and  Wjare  the  work  due  to  external  and  internal  forces,  respectively.  Here,  u  =  constant  means  to  keep  the 
same  displacements  u  during  the  virtual  crack  extensioa  This  is  due  to  the  fact  that  the  terms  associated  with  6u  vanish 
because  the  finite  element  solution  yields  the  minimum  potential  energy,  i.e.,  6P/6u  =  0. 

Internal  work  Wj  =  J,,  U  dV,  where  U  is  the  strain  energy  density  and  V  is  the  entire  volume  of  the  solid  under 
consideration.  The  variation  of  the  internal  work  equation  gives 


bWj 


j  UbidV)  + 

V 


j  hU  dV 

V 


(6) 


For  linear  elastic  materials,  U  =  Vi  ot  and  6U  =  o  8e.  For  nonlinear  materials,  U  depends  on  the  material  constitutive 
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relationship  and  loading  history,  and  is  generally  available  as  a  step-by-step  accumulation  result  from  the  analysis 
program.  6U  can  be  evaluated  from  a  Taylor  series  expansion  as  6U  =  o  6e  +  Vi  6o  6e  .  Since  6o  is  unknown,  the 
first  order  ^proximation  6U  =  o  6e  is  used,  where  Cauchy  stress  and  the  variation  of  the  small  strain  induced  by  virtual 
crack  extension  are  utilized  in  FEINT .  It  would  be  more  accurate  if  the  actual  material  model  is  used  to  determine  6U. 

Only  load  in  the  vicinity  of  the  crack  will  contribute  to  6W,  because  a  virtual  crack  extension  produces 
displacement  changes  only  in  a  small  region  near  the  crack  tip.  Restrict  loads  in  the  vicinity  of  the  crack  to  normal 
pressure  on  the  crack  surface,  and  assume  no  change  in  load  magnitude  during  the  virtual  crack  extension.  Then,  we 
have 


^  jpn  '  u  dS  - 

where  p  is  the  normal  pressure  applied  to  the  current  deformed  crack  surface,  S„  is  the  deformed  crack  surface  ,  S  is 
the  deformed  and  extended  crack  surface,  and  n  is  the  unit  surface  normal  vector.  Fig.  1  shows  a  virtual  crack  extension 
by  perturbmg  the  crack  tip  node.  Note  that  the  energy  method  does  not  allow  the  separation  of  K,,  K,,  K,,,  in  mixed 
mode  situations,  unless  two  of  these  fields  are  zero,  i.e.,  a  single  mode  fracture  problem. 

In  FEINT,  three  crack  tip  mesh  types  can  be  generated  around  a  crack  tip:  focused  mesh,  rectangular  mesh,  and 
zipper  mesh  (Fig.  2).  The  rectangular  mesh  was  used  in  the  Thiokol  study  for  two-dimensional  models. 

Fracture  Predictions  in  the  Subscale  Motor  Test  Program 

In  ThiokoFs  study,  the  subscale  motor  program  was  particularly  focused  on  the  study  of  the  effect  of  defects.  This 
motor  was  designed  to  encourage  defect  propagation  during  firing.  The  motor  case,  made  of  filament  wound  nylon  in 
a  low-modules  epoxy  matrix,  is  highly  flexible,  and  thus  provides  large  strains  in  the  propellant  grain  under  ignition 
pressurization  to  allow  crack  or  debond  propagation.  The  case  can  reach  1 0  percent  hoop  strain  prior  to  failure  at  about 
900  psi  of  hydroburst  test  pressure.  Meanwhile,  catastrophic  failure  is  not  desired,  so  the  motor  is  provided  with  thick 
internal  insulation  to  protect  the  case  from  early  flame  exposure. 

Numerical  predictions  using  the  FEINT  program  were  made  before  each  of  thirteen  static  firing  tests  of  subscale 
motors.  The  pressure  measurements  and  the  real-time  radiography  (RTR)  observations  provided  the  best  indication 
of  the  growth  of  defects.  A  baseline  motor  without  defects  was  first  tested  and  the  test  data  were  recorded.  For 
numerical  simulation,  a  coarse  finite  element  model  was  used  initially,  allowing  numerous  analyses  with  different 
material  properties  and  assumptions  in  order  to  match  the  test  measurements  as  closely  as  possible.  Then  a  fine  finite 
element  model  was  created  and  the  best  properties  and  analysis  assumptions  were  used  to  predict  the  motor  fracture 
behavior  and  to  find  the  fracture  parameters.  The  flow  charts  of  this  analysis  approach  are  shown  in  Fig.  3. 

The  results  of  large  deformation  structural  analyses  using  ABAQUS  based  on  the  material  properties  determined 
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in  the  coarse  mesh  were  used  to  calculate  the  debond  fracture  energy  at  each  step ,  A  plot  of  debond  fracture  energy 
versus  time  was  obtained.  This  debond  fracture  enei^  history  was  then  used  to  calibrate  a  debond  growth  speed  model 
using  a  modified  power  law,  in  which  the  crack  speed  is  proportional  to  the  difference  between  the  current  fracture 
energy  and^the  critical  energy,  raised  to  a  power: 

f  =  ^  ^  -•//  (8) 


In  this  model,  both  the  critical  fracture  energy,  J„  and  the  crack  speed  coefficient.  A,  are  rate  dependent,  this  rate 
dependence  is  handled  by  making  them  another  power  law  functions  of  the  effective  propellant  modules.  The 
coefficients  obtained  by  fitting  the  finite  element  results  having  a  initial  0.1",  with  the  baseline  motor  test  data  were: 

=  0.00015  ,  A  =  \xl0-^  ,  q  =  0.83  (9) 

Unexpectedly,  the  fracture  parameters  fit  to  the  static  motor  firing  test  data  is  considerably  different  from  those  obtained 
from  laboratoiy  specimen  tests  for  a  similar  propellant  material  (Fig.  4).  The  critical  fracture  energy  obtained  from 
the  static  firing  is  lower  than  the  laboratory  data  by  an  order  of  magnitude.  Thiokol  commented  that  it  was  not  known 
whether  the  differences  were  from  the  material  itself  or  because  of  the  different  configuration  and  loading  between  the 
motor  and  a  laboratory  specimen.  However,  if  the  critical  fracture  energy  is  a  material  property,  it  shall  mot  be 
specimen  geometry  or  loading  configuration  dependent.  Additional  work  is  needed  in  identifying  realistic  fracture 
properties  of  propellant.  The  properties  obtained  from  the  first  motor  static  firing  test  were  used  in  the  subsequent 
numerical  simulations. 

Based  on  the  test  results  and  the  failure  analysis  of  the  first  motor,  the  motor  was  redesigned  to  include  the  addition 
of  flaps  at  both  ends  of  the  grain  and  an  increase  in  the  nozzle  throat  diameter.  Fig.  5  shows  the  flow  chart  for  the 
subsequent  analyses.  Fracture  energy  calculated  from  the  structural  analysis  was  used  to  determine  whether  the  debond 
or  crack  would  propagate  and  how  fast  based  on  the  debond  speed  model.  The  coupled  bumback/ballistics  analysis 
calculated  the  pressure  histoiy,  which  was  applied  as  a  load  in  the  structural  analysis.  The  latter  in  turn  provided  strains 
as  a  function  of  the  pressure,  creating  strain  histories  for  computing  time-dependent  effective  moduli  for  the  propellant, 
the  insulator,  and  the  case. 

Table  1  shows  different  defects  and  FEM  models  used  in  the  subscale  motor  tests.  Fig.  6  shows  the  subscale  motor 
configuratioa  For  the  motors  without  defects,  the  predictions  agreed  well  with  the  test  data.  But  for  those  with  cracks 
or  debonds,  they  did  not  match  well.  Three  predicted  noncritical  cracks  and  two  predicted  critical  cracks  were  tested. 
The  criticality  of  bore  cracks  was  predicted  correctly,  but  crack  propagation  speeds  were  much  overpredicted.  Debond 
propagation  was  predicted  for  all  four  axisymmetric  debond  motors,  but  was  observed  in  none  of  them.  A  known 
natural  debond  in  one  motor  was  predicted  to  propagate;  it  did  not.  However,  a  similar  debond  in  another  motor 
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propagated  extensively.  The  differences  bet^veen  predictions  and  test  measurement  in  tests  1 0,  1 1 ,  and  1 2  are  shown 
in  Fig.  7. 

Fracture  Mechanics  Models  for  Solid  Rocket  Propellant  Motors 

A  solid  rocket  propellant  may  be  classified  as  a  viscoelastic  material.  Its  mechanical  behavior  is  highly  time,  strain- 
rate,  and  temperature  dependent.  A  good  fi-acture  mechanics  model  should  be  able  to  capture  these  characteristics. 
The  conventional  J-integral  [Rice,  1968]  will  uniquely  characterize  the  crack-tip  conditions  in  a  viscoelastic  material 
only  for  a  given  failure  time.  This  means  a  critical  J  value,  J„ ,  taken  fi-om  a  laboratoty  test  is  relevant  to  a  structure 
only  if  they  have  similar  failure  times.  Thus,  it  is  difficult  to  apply  such  data  to  structural  components.  To  account  for 
the  time  dependence  of  viscoelastic  materials,  Schapery  [1984]  proposed  a  characterizing  parameter,  J^,  and  showed 
that  J„  uniquely  defines  the  crack-tip  conditions,  making  it  a  suitable  fi'acture  criterion  for  a  wide  range  of  rate-sensitive 
materials.  By  limiting  the  dependence  of  the  time-shifi  factor  on  the  stress  or  by  choosing  specific  power-law  forms 
for  the  constitutive  response,  Schapery  [1990]  further  showed  that  J„  is  path  independent  for  nonlinearly  viscoelastic 
materials  with  constitutive  equations  based  on  a  reduced  time,  and  is  a  candidate  parameter  for  the  characterization  of 
the  crack-tip  fields.  Bencher  and  his  co-workers  [1995]  measured  the  fracture  toughness  of  H-24  propellant  at  three 
different  strain  rates  and  three  selected  temperatures  using  Schapery’s  time-dependent  viscoelastic  approach.  They 
concluded  that  J,  appears  to  provide  an  appropriate  measure  of  the  toughness  of  that  propellant. 

A  J-integral  value  can  be  computed  fi-om  its  elastic  and  plastic  components  from  measurements  of  the  applied  load 
and  load-point  displacement  using  [Anderson,  1991] 


J  = 


E' 


(10) 


where  E’  equals  E  or  E/(  1  -v)  in  plane  stress  or  plane  strain  problems,  respectively,  h  is  the  uncracked  ligament  length, 
b  is  the  specimen  thicknesses,  P  is  the  applied  load,  A  is  the  load-point  displacement,  and  t|  is  a  geometry-dependent 
factor. 

The  viscoelastic  integral  can  be  calculated  from  the  same  formula  by  integrating  a  load  vs.  pscudoclastic 
displacement  (p  -  A')  curve  rather  than  a  load  vs  load-point  displacement  (p  -  A)  curve.  The  P  -  A*  curve  is  converted 
from  a  p  -  A  curve  by  removing  the  creep  displacements  arising  from  the  material’s  viscous  behavior  that  occur  during 
the  fracture  lest.  This  was  achieved  by  conducting  a  stress  relaxation  test  at  each  of  the  relevant  temperatures,  and 
dividing  the  measured  stress  by  the  constant  strain  to  calculate  the  effective  Young’s  modulus  as  a  function  of  time,  E(t), 
Then,  the  pseudoelastic  displacement  for  a  constant-displaccment-rate  fracture  test  is  given  by 
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(11) 


A®  =  A 

where  reference  modulus  Er  is  defined  as  the  short-time  relaxation  modulus  taken  at  time=0,  and  E(t)  is  independent 
of  the  strain  applied  in  the  stress  relaxation  test  and  the  P-A'  curve  after  removing  the  relaxation  strains  will  be  linear. 

Another  critical  aspect  is  the  influence  of  inelastic  fi-acture  zone.  The  linear  elastic  stress  analysis  indicates  that 
tensile  stress  at  a  sharp  crack  tip  approaches  infinity.  Since  material  cannot  withstand  infinitely  large  stress,  an  inelastic 
zone  must  exist  in  front  of  the  crack  tip.  In  the  inelastic  zone,  some  fi'acture  process,  such  as  microcracking  or 
microvoid  formation,  occurs.  Thus,  the  zone  is  also  named  as  the  fi'acture  process  zone.  For  homogeneous  materials, 
because  the  size  of  the  fracture  zone  is  negligible  compared  to  the  overall  dimensions  of  the  member  examined,  the 
effect  of  the  zone  is  usually  ignored,  and  thus  linear  elastic  firacture  mechamcs  (LEFM)  can  be  approximately  applied. 
However,  this  nonlinearity  localized  at  the  crack  tip  is  usually  more  prominent  in  inhomogeneous  materials  since  the 
fi'acture  process  zone  spreads  to  a  relatively  large  size  due  to  the  heterogeneity  nature  of  the  materials.  Solid  rocket 
propellants  possess  a  microstructure  dictated  by  agglomerates  of  filler  particles  embedded  in  polymer  matrix,  and  are 
considered  as  inhomogeneous  materials.  In  order  to  better  simulate  the  fi-acture  behavior  of  solid  rocket  propellant 
motors,  consideration  of  the  fracture  process  zone.  While  LEFM  requires  only  one  fracture  parameter  to  characterize 
the  crack  tip  behavior,  it  requires  more  than  one  parameter  to  account  for  the  influence  of  fracture  process  zone. 
Several  models  have  been  proposed: 

1.  Fictitious  crack  model  -  An  early  attempt  to  model  the  behavior  in  the  fracture  process  zone  was  presented  by 
Dugdale  and  is  considered  to  be  an  extension  of  the  linear  elastic  behavior  to  ductile  materials.  Because  of  its 
assumption  of  elastic  perfectly  plastic  behavior,  this  model  cannot  accurately  represent  damage  in  inhomogeneous 
composites  since  it  fails  to  account  for  factors  such  as  microcracking  and  particle  pullout.  Hilleborg  [1776]  proposed 
a  fictitious  crack  model,  which  is  considered  more  suitable  for  inhomogeneous  materials.  The  model  is  based  on 
applying  a  cohesive  stress,  which  decreases  with  the  increment  of  the  crack  opening  displacements  rather  than  a 
constant  value,  on  the  crack  surface  near  the  crack  tip.  The  fracture  parameters  of  this  model  include  the  tensile 
strength,  f^  the  crack  opening  displacement  beyond  which  cohesive  stress  vanishes,  w,, ,  and  the  energy  release  rate, 
Gp .  Gp  is  defined  as  the  area  under  the  cohesive  stress  versus  crack  opening  displacement,  o  -  w,  curve,  i.e., 

? 

Gp  =  I  a(>v)  dw  (12) 

0 

Different  o  -  w  relations  were  introduced  since,  such  as  linear,  bilinear,  and  exponential  decay.  The  relation  varies  with 
materials  and  has  to  be  determined  experimentally. 


15-9 


2.  Size-effect  model  -  For  members  that  follow  the  strength  criterion  or  yield  criterion,  the  log-log  plot  of  peak  loads 
versus  member  sizes  represents  a  horizontal  line,  i.e.,  there  is  no  size  effect.  The  linear  elastic  fracture  mechanics 
yields  the  strongest  possible  size  effect  which  corresponds  to  a  straight  inclined  line  of  slope  - 1  /2  .  For  materials  that 
exhibit  distributed  cracking  (fracture  process  zone),  the  size  effect  plot  represents  a  smooth  transition  from  the 
horizontal  line  for  the  strength  cnterion  to  the  inclined  straight  line  for  the  linear  elastic  fracture  mechanics  [Bazant  and 
Pfeiffer,  1 987],  Since  test  results  vary  with  member  sizes  due  to  the  size  effect,  it  is  difficult  to  find  material  fracture 
parameters.  Thus,  the  size-effect  model  extrapolates  the  experimental  data  to  those  for  the  infinite  specimen  and  based 
on  that  defines  two  fracture  parameters  Gf  and  Cf.  the  critical  energy  release  rate  and  the  critical  effective  crack  extension 
for  the  infinite  specimen,  respectively.  [Bazant  and  Kazemi,  1 990].  To  determine  these  two  fracture  parameters,  at  lease 
three  different  sizes  of  geometrically  similar  specimens  have  to  be  tested.  Bazant  and  Lin  [  1 988]  analyzed  fracture  tests 
on  specimens  of  various  sizes  by  finite  elements  using  a  nonlocal  smeared  cracking  model.  Their  numerical  results 
agreed  well  with  the  size-effect  law. 

3.  Two-parameter  fracture  model  -  As  the  load  applied  to  a  specimen  increase  from  zero  to  its  maximum  value, 
the  crack  length  changes  from  a„  to  a^ ,  where  a„  is  the  initial  crack  length  and  a,  is  the  critical  effective  crack  length  at 
the  peak  load.  LEFM  can  then  be  used  to  present  a  criterion  for  failure: 


K 


o 


Ic 


N  c 


(13) 


where  is  the  nominal  failure  stress  (the  peak  value  of  the  nominal  stress),  Kj,  is  the  critical  stress  intensity  factor  for 
the  specimen,  and  F  is  a  geometiy-dependent  function.  It  has  been  shown  that  the  value  of  Kj,  remains  constant  for 
specimens  of  the  same  material.  However,  the  value  of  a,  depends  on  specimen  size  and  geometry,  and  cannot  be  used 
as  a  structure-independent  material  parameter,  Jenq  and  Shah  [1985]  found  that  when  structures  of  the  same  material 
fail,  values  of  the  elastic  part  of  the  crack  tip  opening  displacement  are  approximately  constant,  regardless  of  the  size 
of  the  structure.  Based  on  their  observation,  they  proposed  the  two-parameter  fracture  model,  and  claimed  that  the 
criterion  for  failure  of  a  structure  can  be  expressed  as  simultaneous  satisfaction  of 

,  CTOD(o^^,a^)  =  CTOD^  (14) 

where  CTOD  represents  the  elastic  crack  tip  opening  displacement  at  the  location  of  initial  crack  tip.  Functions  for  K, 
and  CTOD  on  the  left  side  of  the  above  two  equations  in  terms  of  applied  load  and  crack  length  are  based  on  LEFM 
and  can  be  found  in  LEFM  manuals  for  various  structure  geometries.  Both  and  CTOD^  are  regarded  as  material 
fracture  parameters  that  are  independent  of  structural  size  and  geometry.  The  value  of  critical  length  a,  is  determined 
by  equating  the  unloading  compliance  of  applied  load  versus  crack  mouth  opening  displacement  at  the  peak  load  to  the 
initial  compliance,  so  its  value  is  somewhere  between  the  actual  crack  length  and  the  length  of  the  actual  crack  as  well 
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as  the  fracture  process  zone. 


Nonlocal  Smeared  Cracking  Model 

In  this  study,  the  authoF,,used  a  nonlocal  smeared  cracking  model  [Bazant  and  Lin,  1988]  to  simulate  the  effect  of  the 
fracture  process  zone.  Ihe  cracking  is  modeled  by  adjustments  of  material  stif&iess.  This  approach  is  called  smeared 
cracking.  To  avoid  spurious  mesh  sensitivity  and  convergence  to  an  incorrect  failure  mode  with  zero  energy 
dissipation,  which  is  typical  of  the  classical  smeared  cracking  model,  several  alternatives  have  been  proposed.  One  of 
them  is  the  coupling  of  the  crack  band  model  [Bazant  and  Oh,  1983]  with  a  nonlocal  continuum  approach.  The  idea 
is  that  the  nonlocal  treatment  should  be  applied  only  to  those  variables  which  cause  strain  softening,  while  the  other 
variables,  especially  the  elastic  strain,  should  be  local.  In  this  form  of  nonlocal  approach ,  the  differential  equations 
of  equilibrium  with  the  boundary  conditions  retain  their  standard  form,  no  extra  boundary  conditions  need  to  be 
introduced,  the  continuity  requirements  for  finite  elements  remain  the  same  as  for  the  local  formulation,  and  spurious 
zero-energy  periodic  instability  modes,  which  cause  problems  for  the  original  nonlocal  formulation,  do  not  exist.  Also 
the  method  produces  numerical  results  in  agreement  to  the  size  effect. 

In  two-dimensional  cases,  the  crack  band  model  may  be  characterized  by  the  stress-strain  relation: 
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in  which  is  the  fracturing  strain;  Cn  ,  £22  >  said  Y12  ore  the  normal  strains  and  the  shear  angle;  ,  O22 ,  and  O12  are 
the  normal  and  shear  stresses;  are  the  initial  elastic  compliances;  and  the  numerical  subscripts  refer  to  the  cartesian 
coordinates  and  X2.  Assuming  isotropy,  we  have  =  Q222  =  1/E’  ,  Q,22  =  Qzn  =  v’/E’ ,  ^^2  ==  2(l+v)/E,  in 
which  for  plane  stress  E’  =  E  ,  v’  =  v  ,  and  for  plane  strain  E’  =  E/(  1  -  v^),  v’  =  v  /  (1  -  v). 

The  cracks  are  assumed  to  be  parallel  and  normal  to  the  axis  Xj.  We  introduce  the  classical  hypothesis  that,  at  the 
time  the  cracks  start  to  form,  the  direction  of  the  cracks  is  fixed  as  normal  to  the  maximum  principal  stress  Oj  at  that 
instant  and  remains  constant  afterwards.  If  Oj  does  not  rotate,  Oj2  remains  zero;  however  in  general  may  rotate,  and 
in  that  case  one  must  take  into  account  the  shear  stifihiess  due  to  aggregate  interlock  on  rough  crack  surfaces.  This  is 
done  by  multiplying  the  shear  compliance  C1212  with  an  empirical  coefficient  p,  called  the  shear  retention  factor. 

It  has  been  shown  [Bazant  and  Oh,  1983]  that  Eq.  15  is  equivalent  to: 
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in  which  the  normal  strffiiess  C , , , ,  in  the  direction  normal  to  the  cracks  is  reduced  by  the  coefficient  (1  -  w) ,  where  tj 
represents  what  is  known  in  continuum  damage  mechanics  as  damage  and  may  be  regarded  as  the  cracked  area  fraction; 
0)  -  0  corresponds  to  the  initial  state  of  no  cracking  (zero  damage);  and  o)  =  1  corresponds  to  the  final  state  of  complete 
cracking  (complete  damage).  Comparing  Eqs.  15  and  16  ,  we  note  that  e^=  [  cu  C,,,,  /  (1  -  u)]  q  Diverse 
assumptions  may  be  adopted  for  unloading  :  secant  unloading,  linear  and  exponential  strain  softening. 

The  nonlocal  continuum  is  a  continuum  for  which  at  least  some  variables  are  defined  by  spatial  averaging.  Only 
those  variables  that  cause  strain  softening  should  be  considered  as  nonlocal.  This  means  that  the  damage  should  be 

made  nonlocal.  This  is  accomplished  by  specifying  it  as  a  function  of  the  average  (nonlocal)  strain,  which  is  defined 
by  spatial  averaging  as  follows: 

-x)  <e,,(5)>  dV  = 

V  V 


aXx,s)  <eu(5')>  dV 


(17) 


in  which 


V^x)  =  I  a(s-x)  dV  ,  a'{x,s)  = 

J  m 

V 


(18) 


The  overbar  is  a  label  for  the  nonlocal  variables,  V=  volume  of  the  entire  body,  a(x)  given  weighting  lunction  which 
is  treated  as  a  material  property,  and  s  =  general  coordinate  vector.  has  approximately  but  not  exactly  the  same 
meaning  as  the  representative  volume  in  the  statistical  theory  of  heterogeneous  materials.  The  pointed  brackets  <  > 
denote  the  positive  part  of  the  variable. 

Calculations  converge  better  if  the  weighting  function  is  smooth.  A  suitable  choice  is  the  normal  (Gau.ssian) 
distribution : 
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(19) 


a(x)  =  e  ^ 


in  which  k  is  a  constant  whose  value  is  k  =  2  for  the  case  of  two  dimensions;  |  x|  ^  ;  and  1  is  the  characteristic 

length  of  the  material,  which  may  be  interpreted  as  the  radius  of  a  circle  on  which  the  uniform  distribution  a  =  1  has 
the  same  volume  as  the  normal  distribution  function  extending  to  infinity  in  the  plane. 

If  the  bocfy  is  finite,  the  normal  distribution  function  obviously  extends  beyond  the  boundary.  This  is  handling  by 
deleting  the  region  outside  the  bocfy  from  the  integration  domain  V,  both  for  the  calculation  of  the  average  (Eq.  1 7  )  and 
for  the  calculation  of  the  representative  volume  (Eq.  1 8).  This  fact  causes  to  depend  on  location  x. 

In  finite  element  calculations,  the  integrals  in  Eqs.  1 7  and  1 8  are  approximately  evaluated  as  finite  sums  over  all 
the  integration  points  of  all  the  elements  in  the  structure.  However,  the  integration  points  whose  distances  from  the  point 
X  exceed  approximately  21  may  be  omitted  since  for  them  the  value  of  a  is  negligible. 

The  nonlocal  damage  is  considered  a  function  of  the  nonlocal  normal  strain: 

(  1  -«)■’=  ^  (^)  (20) 


Thus,  the  nonlocal  constitutive  equation  with  local  strain  takes  the  form: 
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Two-dimensional  Finite  Element  Modeling 

A  two-dimensional  model  studied  by  Thiokol  was  analyzed  using  a  nonlocal  smeared  cracking  model.  The  model  was 
analyzed  assuming  plain  strain  conditions.  The  finite  element  mesh  is  shown  in  Fig.  8,  and  the  detail  of  the  mesh  around 
the  crack  tip  is  shown  in  Fig.  9.  The  model  includes  a  0.3'*  deep  bore  crack  and  has  1386  4-node  quadrilateral  elements 
with  1501  nodes.  Transitional  quadrilateral  elements  were  used  in  the  modeling  (Fig.  9).  The  material  properties  used 
in  the  analysis  are  as  follows.  For  propellant,  effective  Young’s  modulus  is  800  psi  and  Poisson’s  ratio  is  0.499.  For 
insulator,  Young’s  modulus  is  1000  psi  and  Poisson’s  ratio  is  0.499.  For  motor  case,  effective  Young’s  modulus  is 
375,000  psi  and  Poisson’s  ratio  is  0.4.  The  characteristic  length,  1,  of  the  propellant  obtained  from  best  fit  of  the  test 
results  is  0.5".  Isotropic  conditions  were  assumed  in  this  study. 

To  simulate  the  burning  of  the  propellant,  the  interior  bore  diameter  is  increased  piece  wisely  every  two  seconds 
similar  to  those  used  in  Thiokol’s  study  (Fig.  10).  Fig.l  1  and  Fig.  12  shows  the  size  of  the  fracture  process  zone 
obtained  from  the  nonlocal  finite  element  analysis  at  the  internal  bore  pressure  of  160  psi  and  300  psi,  respectively. 
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Fig.  13  shows  the  comparison  of  internal  pressure  between  test  results  and  nonlocal  simulations.  Fig.  14  shows  the 

conqjanson  between  Thiokol’s  predictions  and  nonlocal  simulations.  The  comparison  of  hoop  strain  is  shown  in  Fig. 
15. 

Discussion  and  Conclusions 

In  order  to  achieve  a  successful  numerical  simulation  of  the  firing  performance  of  a  solid  propellant  rocket  motor,  it 
is  essential  to  have  adequate  constitutive  relations  for  the  materials  that  form  the  motor  (propellant,  insulator,  adhesive 
liner,  and  case).  A  linear  elastic  model  is  the  simplest  one,  but  cannot  achieve  a  good  prediction.  More  elaborated 
models  may  include  plastic  effects,  viscoelasUcity,  and  viscoplasUcity.  Temperature-dependent  material  properties 
and  rate  effect  should  also  be  included  to  have  a  better  prediction.  Cracks,  voids,  and  debonds  (adhesive  cracks) 
compound  the  difficulty  of  the  modeling.  Their  existence  alters  the  ballistics  performance  by  exposing  a  larger 
burning  surface  area  Proper  fracture  criteria  to  detennine  when  a  crack  will  propagate  and,  if  so,  which  direction  and 
how  far  the  crack  will  go  have  to  be  sought.  The  modeling  of  debond  or  adhesive  crack  propagation  is  even  more 
complex  because  it  involves  two  different  materials  at  the  crack  tip. 

Schapery’s  J„  integral  was  particularly  developed  for  viscoelastic  materials.  It  is  suggested  to  incorporate  this 
integral  into  FEINT  as  a  fracture  toughness  to  predict  if  a  crack  will  propagate.  To  include  the  effect  of  the  fracture 
process  zone,  we  may  consider  the  nonlocal  smeared  cracking  approach,  Hillerborg’s  fictitious  crack  model,  or  Jenq 
and  Shah’s  two-parameter  model.  In  Thiokol’s  subscale  motor  test  program,  the  crack  propagation  speeds  were 
generally  overpredicted.  Liu  and  Ravi-Chandar  [1996]  showed  that  a  power  law  relationship  exists  between  mode  I 
stress  intensity  factor  and  the  crack  growth  speed.  It  was  also  observed  in  Thiokol’s  lest  program  that  some  motor 
failures  were  caused  by  splitting  of  the  insulator  at  the  flap  tip,  rather  than  the  crack  propagation  in  the  propellant, 
allowing  the  flame  to  bum  through  the  aft  dome.  It  is  suggested  that  a  future  numerical  analysis  should  take  into 
account  the  effect  of  the  sharp  tip  caused  by  making  a  flap  in  the  insulator. 

The  nonlocal  smeared  cracking  model  has  been  used  successfully  for  cementitious  materials.  This  model  could 
be  applied  to  the  simulation  of  the  firing  of  a  solid  propellant  rocket  motor.  The  numerical  simulations  of  this  study 
show  fair  agreement  with  the  experimental  results.  To  improve  the  fracture  behavior  predictions,  we  may  incorporate 
the  nonlocal  model  in  the  FEINT  program.  In  this  way,  we  can  take  advantage  of  all  the  modules  of  FEINT  to  take 
into  account  various  aspects  of  the  complex  motor  firing  process.  Also,  it  is  essential  to  obtain  realistic  fracture 
parameters  experimentally  in  order  to  further  improve  the  numerical  simulation. 
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Table  1:  Subscale  Motor  Tests 


Test 

No.^ 

Test 

Date 

Description 

Nozzle 
Throat,  <j) 

FEM  Model 

1 

5/17/93 

Original  baseline,  no  flaps,  no  flaws 

2.0  in. 

Axisymmetric 

2 

7/13/94 

Redesign  baseline,  with  flaps,  no  flaws 

2.3  in. 

Axisymmetric 

3 

7/14/94 

Full-length  bore  crack,  0.3  in.  deep 

2.3  in. 

Gen.  Plane  Strain 

■I 

7/19/94 

Full-length  bore  crack,  0.35  in.  deep 

2.0  in. 

Gen.  Plane  Strain 

5 

3/13/95 

Buried  debond,  axisym.,  0.75  in.  wide 

2.3  in. 

Axisymmetric 

6 

3/14/95 

Buried  debond,  axisym.,  0.75  in.  wide 

2.3  in. 

Axisymmetric 

m 

3/15/95 

Voids,  0.72  X  0.47  x  0.30  in.  and  smaller 

2.3  in. 

3-D 

8 

3/16/95 

Edge  debond,  axisymmetric,  1.0  in.  deep 

2.3  in. 

Axisymmetric 

9 

3/17/95 

Edge  debond,  axisymmetric,  1.0  in.  deep 

2.3  in. 

Axisymmetric 

10 

3/17/95 

Half-circle  plane  bore  crack,  r  =  0.4  in. 

2.1  in. 

3-D 

11 

3/20/95 

Half-circle  plane  bore  crack,  r  =  0.6  in. 

2.1  in. 

3-D 

12 

3/22/95 

Half-circle  plane  bore  crack,  r  =  1.7  in. 

2.1  in. 

3-D 

13 

3/27/95 

0.3-in.  axisym.  edge  and  natural  debonds 

2.3  in. 

3-D 
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CRACK  F«>^r 


Fig.  2  Three  Crack  Tip  Mesh  Types  Available  in  FEINT:  (a)  Focused  Mesh,  (b)  Rectangular 
Mesh,  and  (c)  Zipper  Mesh. 
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Fig.  3  Numerical  Analysis  Flow  for  Baseline  Motor  Analysis:  (a)  Coarse  Model  Analysis  Flow, 
(b)  Fine  Model  Analysis  Flow. 
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Fig.  4  Fracture  Parameters  Obtained  from  Motor  Firing  Tests  and  Laboratory  Specimen  Tests: 
(a)  Critical  J-integral  Value  versus  Effective  Modulus,  (b)  Debond  Speed  Parameter  A 
versus  Effective  Modulus. 
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Fig.  5  Numerical  Analysis  Flow  for  Redesigned  Motor  Analysis. 
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26.9 
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Fig.  6  Subscale  Motor  Configuration. 


Time  (sec) 


Fig.  7  Numerical  Prediction  and  Test  Measurement  for  Subscale  Motor  Test  10  11  and 


15-24 


Fig.  8  Finite  Element  Model  (Half  Circle  with  0.3-inch  crack  is  shown) 


Fig.  9  Crack  Tip  Model  Used  in  this  Study. 


Fig.  10  Sequence  of  Models  for  Motor  Analysi; 


Fig.  1 1  Fracture  Process  Zone  at  a  Internal  Pressure  of  160  psi  (2  seconds) 


15-27 


Fig,  12  Fracture  Process  ^onc  r  Intemsl  Pressure  of  300  psi  (4  seconds) 
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Fig.  14  Comparison  Between  2-D  Thiokol’s  Predictions  and  Nonlocal  Numerical  Simulation. 
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Comparison  of  Hoop  Strain  Among  Test  Results,  Thiokol’s  Predictions, 
and  Nonlocal  Numerical  Simulation 
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CROSS  SECTIONAL  AREA  ESTIMATION  OF  ORBITAL  DEBRIS 


Ronald  A.  Madler 
Assistant  Professor 
Department  of  Aerospace  Engineering 
Embry-Riddle  Aeronautical  University 


Abstract 


The  physical  characteristics  of  breakup  debris  are  essential  to  properly  estimate  the  orbital  debris 
environment  and  its  hazard  to  spacecraft.  This  paper  examines  methods  of  determining  certain  physical 
characteristics  utilizing  ground  based  fragmentation  debris.  A  well-defined  and  easily  repeatable  method  of 
determining  the  cross  sectional  area  of  ground  based  fragmentation  debris  has  been  defined.  This  method  expands 
on  previous  methods  of  obtaining  and  modeling  physical  characteristics  of  debris.  The  results  of  these  measurement 
methods  are  compared.  This  report  presents  methods  and  results  of  direct  measurements  of  the  cross  sectional  area 
of  fragmentation  debris. 
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CROSS  SECTIONAL  AREA  ESTIMATION  OF  ORBITAL  DEBRIS 


Ronald  A.  Madler 


Introduction 

There  exists  a  need  to  quantify  and  characterize  the  hazard  posed  to  orbiting  assets  by  space  debris.  Currently  there 
are  approximately  9,500  objects  being  tracked  by  the  United  State  Space  Command’s  (USSPACECOM)  Space 
Surveillance  Network  (SSN).  The  majority  of  these  tracked  objects  are  operational  and  fragmentation  debris.  The 
tracked  objects  in  orbit  as  of  June  1996  are  categorized  as  follows:  satellite  fragmentation  debris  (39%),  inactive 
payloads  (24%);  rocket  bodies  (18%);  operational  debris  (11%);  active  payloads  (6%);  anomalous  event  debris 
(2%).'  A  much  larger  population  of  objects  below  the  detection  threshold  is  known  to  exist  from  the  approximately 
150  on-orbit  satellite  breakups  that  have  occurred  to  date.  Although  not  trackable,  they  still  pose  a  threat  to  space 
operations.  Hence,  it  is  important  to  be  able  to  model  the  fragments  from  these  on-orbit  fragmentation  events. 

The  relationships  between  mass,  area,  and  size  are  important  because  these  values  affect  the  decay  of  resident  space 
objects  (RSOs),  the  expected  collision  rate,  the  hazard  to  other  RSOs,  and  the  observability  of  the  debris.  The 
relationships  between  mass,  number,  and  size  have  been  examined  extensively.^'*’  Cross  sectional  area  for  fragments 
is  important  to  model  fragment  behavior,  but  is  difficult  to  determine.  Several  researchers  have  examined 
observations  of  on-orbit  fragmentation  debris,  but  great  uncertainties  exist  in  determining  area  of  these  objects  due 
to  the  long  distance  to  the  objects  and  the  many  unknowns.’’'® 

This  report  examines  a  method  to  determine  the  cross  sectional  area  of  debris  fragments  at  a  large  number  of 
orientations,  from  which  an  average  value  can  be  determined.  The  method  has  been  used  on  various  shapes  ot 
known  area.  The  cross  sectional  areas  of  some  of  the  fragments  also  have  been  measured  with  other  methods  to 
verify  the  results. 

Methodology 

A  set  of  simulated  debris  was  obtained  for  this  project  from  a  hypervelocity  impact  test.  Shot  CU-6470,  a 
hypervelocity  impact  conducted  at  the  Arnold  Engineering  Development  Center  (AEDC),  was  examined  in 
cooperation  with  the  University  of  Colorado  and  the  owners  of  the  fragments,  NASA/JSC.  The  physical  parameters 
of  size,  mass,  and  area  were  determined  for  these  fragments.  Also,  some  simple  shapes  were  obtained  and  used, 
including  a  sphere,  cube,  slender  rod.  cylinder,  and  flat  plate. 
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Previous  characterization  of  debris  object  size  has  focused  on  a  pieces  characteristic  dimensions.  When  obtaining 
three  characteristic  dimensions,  the  first  dimension  is  the  longest  dimension  of  an  object,  the  second  dimension  is 
the  longest  dimension  orthogonal  to  the  first,  and  the  third  dimension  is  the  longest  dimension  orthogonal  to  the  first 
two.  The  first  two  dimensions  are  demonstrated  in  Figure  I .  The  average  of  these  three  dimensions  is  THE 
characteristic  dimension  and  is  usually  referred  to  as  the  debris  diameter. 

Previous  methods  of  determining  the  area  for  this  research  using  a  shadowgraph  technique  and  using  a 
planimeter."  '-  The  shadowgraph  method  relates  light  intensities,  with  and  without  a  fragment,  to  determine  the 
cross  sectional  area  seen  at  various  orientations.  Although  this  method  resulted  in  useful  data,  it  is  time  consuming 
and  difficult  to  repeat  as  it  requires  the  use  of  a  special  dark  room  and  light  sensing  equipment.  The  planimeter  is  a 
mechanical  device  that  determines  area  by  tracing  the  outline  of  an  object.  A  variety  of  random  images  containing  a 
reference  of  known  area  and  debris  are  taken  and  printed  out.  The  planimeter  is  then  used  to  trace  the  debris  and 
reference.  Comparing  these  values  gives  a  cross  sectional  area.  This  device  produces  useful  results  when  several 
pictures  of  a  fragment  are  analyzed;  however,  it  is  extremely  time  consuming. 


Figure  I:  Characteristic  dimension  measurement 

By  determining  the  cross  sectional  area  of  a  fragment  at  many  orientations,  an  average  area  and  normalized  relative 
area  frequency  (NRAF)  distribution  can  be  directly  computed.  NRAF  distributions  will  be  discussed  in  more  detail 
in  the  results  section  of  this  report.  This  process  can  be  carried  out  using  a  digital  image  method  (DIM)  wherein 
digital  images  of  a  debris  piece  are  obtained  at  many  orientations.  These  images  can  then  be  run  through  a  digital 
image  analysis  software  package  to  compute  the  average  area  and  area  distribution  quickly  and  accurately. 
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Ima^e  Acquisition 

The  setup  for  image  acquisition  is  crucial  to  the  speed  and  accuracy  of  the  DIM.  A  uniform  background  and 
lighting  must  be  set  in  place  before  images  are  taken.  The  background  color  of  choice  is  a  flat  black.  The  reasoning 
behind  this  choice  of  background  is  that  black,  by  its  nature,  will  absorb  all  or  nearly  all  light  frequencies  (color). 
In  this  way,  the  light  reflected  from  the  debris  piece  and  reference  will  best  stand  out  for  image  analysis.  An 
attempt  at  creating  a  uniform  background  (few  lines  or  creases)  will  also  increase  the  speed  and  accuracy  of  the 
DIM  by  making  the  edge  detection  easier.  A  good  uniform  lighting  source  is  also  important  as  shadows  can  also 
make  the  debris  outline  difficult  to  follow.  Two  overhead  fluorescent  lights  are  used  along  with  two  high  wattage 
halogen  lights.  A  smaller  mobile  halogen  is  placed  on  the  floor  for  bottom  lighting. 

A  reference  of  known  cross  sectional  area  must  be  placed  near  the  debris  piece.  The  prime  choice  is  a  white  sphere, 
as  it  will  reflect  the  most  light  and  has  the  same  cross  sectional  area  at  every  orientation.  The  reference  is  used  as  a 
known  area  in  the  digital  image  analysis  process  to  compare  and  determine  the  area  of  the  debris  piece.  This  is 
necessary  because  the  focal  length  and  distance  from  the  camera  to  object  may  change  from  image  to  image.  The 
change  in  focal  length  is  due  to  the  fact  that,  in  order  to  utilize  all  resolution  possible,  the  camera  is  zoomed  so  that 
the  reference  and  debris  piece  fill  the  image.  This  being  the  case,  as  the  area  seen  changes  the  amount  of  zoom 
changes  as  well.  The  debris  piece  and  reference  object  are  placed  on  the  top  of  two  thin,  black  stands.  The  stands 
are  located  at  the  center  of  the  area  in  which  images  will  be  taken.  The  stand  holding  the  debris  piece  remains  fixed 
in  one  location  while  the  stand  holding  the  reference  is  moved  about  the  debris  piece  to  ensure  that  the  reference  is 
always  located  to  the  left  of  the  debris  piece,  which  is  necessary  for  the  automated  version  of  the  image  analysis 
process. 

A  fairly  large  number  of  images  must  be  taken  of  the  debris  to  ensure  a  good  statistical  sample.  The  location  of  the 
digital  camera  for  the  images  must  simulate  all  possible  orientations  (looking  at  the  debris  piece  from  every  possible 
viewpoint)  for  the  average  area  to  be  correct.  A  geodesic  framework  is  used  to  generate  the  viewing  coordinates  so 
that  all  look  angles  are  given  an  equal  and  correct  weighting.  The  geodesic  model  of  choice  is  the  icosahedron,  as 
utilized  by  the  Aerospace  Corporation  in  their  model  for  a  uniform  satellite  fragmentation.’’  An  icosahedron  has  12 
vertices,  20  faces,  and  is  symmetrical  about  a  central  plane.  Because  of  the  symmetry  of  vertices,  only  the  top  half 
of  the  icosahedron  geometry  is  needed  with  the  debris  piece  and  reference  at  the  center.  The  reason  for  this  can  be 
seen  by  imagining  looking  at  your  hand  from  the  front  and  back;  both  sides  show  the  same  area,  although  their  exact 
surfaces  may  be  different.  By  dividing  the  faces  into  several  equilateral  triangles,  and  including  the  centroids  of  the 
triangles,  a  total  of  66  image  locations  can  be  attained.  However,  as  the  debris  piece  lies  at  the  center  of  the  virtual 
icosahedron,  the  points  around  the  center  are  duplicated.  For  the  same  reason  that  only  5  of  the  10  points  at  the 
center  are  used.  Thus  a  total  of  61  images  can  be  taken.  To  create  a  more  complete  and  accurate  representation  of 
each  debris  piece.  61  images  are  taken  at  the  above  locations,  the  debris  piece’s  orientation  is  changed,  and  61  more 
images  are  taken.  Also,  6  additional  images  are  taken  corresponding  to  three  orthogonal  viewing  directions,  front 


16-5 


and  back,  with  the  thought  that  some  day  the  3  view  may  help  in  creating  a  3-D  model  of  the  piece.  Thus,  a  total  of 
128  images  are  acquired  for  each  debris  piece. 

The  camera  used  to  acquire  the  images  is  a  Kodak  DC  120.  It  has  a  picture  resolution  of  1,2  million  pixels,  a  CCD 
resolution  of  836,400  pixels,  and  a  3X  zoom  lens  equivalent  to  a  35mm  camera.  A  10  MB  picture  card  is  used  to 
hold  all  pictures  in  one  shooting.  Once  the  pictures  have  been  taken  they  are  downloaded  for  analysis. 

Image  Analysis 

Once  the  images  are  downloaded  to  the  workstation,  Matlab  and  the  image  analysis  toolbox  are  used  to  process  the 
images.  An  extensive  program  was  written  in  Matlab  to  analyze  the  digital  images  with  as  much  automation  as 
possible.  Complete  automation  is  possible,  but  is  limited  by  the  quality  of  the  digital  images  themselves.  The 
images  which  cannot  be  processed  automatically  are  flagged  for  manual  analysis. 

There  are  several  steps  necessary  to  determine  the  area  of  an  object  by  comparing  it  to  a  reference  of  known  area. 
The  main  steps  involve  image  separation,  edge  detection,  and  pixel  counting.  Several  methods  for  each  of  the  steps 
were  tested  to  determine  the  most  accurate  one.  Figure  2  on  the  following  page  shows  a  comparison  of  the  results 
of  using  different  threshold  and  area  methods  for  one  test  object.  The  resultant  areas  are  compared  to  results  using  a 
planimeter.  As  seen  the  maximum  difference  is  less  than  five  percent,  which  is  considered  to  be  a  close  value.  The 
final  process  uses  up  to  three  methods  of  determining  cross  sectional  area  to  further  reduce  any  errors  within  one 
method.  Total  processing  time  can  take  from  1  to  3  minutes  per  image,  depending  on  computer  speed  and  image 
size. 

The  program  can  run  in  multiple  variations  of  automated  modes  or  manual  cneck  modes.  It  can  run  all  images  at 
once,  or  a  few  images  at  a  time  until  all  images  are  completed.  The  program  completes  a  series  of  steps  to  calculate 
the  area  of  a  debris  piece.  Once  the  process  to  be  used  has  been  defined  by  the  user,  the  images  are  read  in  and 
analyzed  one  at  a  time.  This  process  can  be  summarized  as: 

•  an  image  is  read  into  Matlab 

•  the  green  matrix  is  extracted 

•  a  threshold  level  is  chosen  to  create  a  binary  image 

•  the  reference  and  debris  are  separated  into  two  images 

•  an  edge  of  each  image  is  found 

•  the  pixels  within  each  image  are  counted 

•  a  best  fit  circle  is  created  from  the  reference  image  and  an  area  calculated  in  pixels 

•  pixel  values  are  compared  and  an  area  is  computed 
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The  final  results  from  running  all  images  consist  of  the  average  area,  a  list  of  all  areas  seen,  and  a  list  of  normalized 
areas  which  make  up  the  NRAF.  A  sample  of  an  image  analyzed  by  this  program  is  shown  in  Figure  3. 


CU-6470  #9:  %  Difference  of  DIM  From  Planimeter  For 
Threshold  &  Area  Methods  1-4 


Threshold  Method 


oArea  1 
m  Area  2 
^Area  3 
xArea  4 
xArea  5 
•  Area  6 
4- Average 


Figure  2:  Threshold  and  area  method  determination  for  debris  piece  CU-6470  #9. 


Figure  3:  Sample  image  of  reference  sphere  and  CU-6470  Piece  #2. 


Calibration 

To  calibrate  the  DIM  and  compare  results  for  useful  relationships,  the  DIM  observations  of  standard  shapes  (sphere, 
cube,  rod,  and  flat  plate)  were  compared  with  their  "ideal”  characteristics.  The  "ideal"  characteristics  were 
simulated  with  a  program  which  utilizes  a  random  number  generator  to  create  a  large  number  of  orientations  for  an 
object.  The  area  can  be  determined  using  an  equation  describing  the  object’s  area  at  a  given  orientation.  For  ease 
of  calculations  3000  random  orientations  are  chosen  from  which  the  area  can  be  calculated.  An  average  area  and 
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NRAF  distribution  can  then  be  determined.  Table  1  shows  resultant  average  area  values  and  error  using  the  DIM 
and  ideal  results  using  this  program  for  some  standard  shapes.  The  maximum  error  is  less  than  ten  percent. 


Shape 

DIM 

Ideal 

Actual 

%  Error  DIM 

%  Error  Ideal 

Sphere 

1.7398 

1.7294 

1.7294 

0.5993 

0.0009 

Cube 

3.3159 

3.3613 

3.3750 

1.7517 

0.4051 

Rod 

1.0670 

0.9831 

1.0201 

4.5968 

3.6253 

Flat  Plate 

7.2305 

8.0593 

8.0000 

9.6183 

0.7416 

Table  1 :  Average  area  comparison  for  DIM,  ideal  program,  and  actual  value. 

The  area  of  several  shapes  were  determined  and  compared  against  actual  areas  or  areas  determined  by  a  planimeter 
to  determine  common  errors  for  individual  images.  The  results  are  shown  in  Table  2.  As  can  be  seen  the  greatest 
error  or  difference  is  less  than  ten  percent. 


Shape 

DIM 

Actual 

%  Error 

Sphere 

1.683 

1.729 

2.674 

Rod 

6.332 

Cylinder 

1.732 

Flat  Plate 

16.048 

16.000 

0.301 

Shape 

DIM 

Planimeter 

%  Difference 

P4 

3.843 

3.529 

8.911 

CU-6470  #9 

4.883 

4.031 

G2 

1.387 

2.084 

Table  2:  Individual  image  analysis  of  several  objects. 

The  NRAF  for  the  image  analysis  program  can  be  compared  to  the  ideal  mathematical  model  as  another  means  of 
calibration.  A  comparison  for  a  flat  plate  is  shown  below  in  Figure  4.  The  results  are  seen  to  agree  quite  well. 

NRAF  For  A  Flat  Plate 
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Figure  4:  Normalized  relative  area  frequency  (NRAF)  distribution 


16-8 


Results 


A  number  of  objects  were  analyzed  using  the  DIM.  The  average  area  and  area  distributions  are  examined  here. 
These  results  are  compared  to  results  from  an  earlier  method,  those  measured  with  a  planimeter,  as  well  as  some 
characteristic  shapes  and  simple  models."  ’^ 

Average  Area  Distribution 

The  average  area  for  several  debris  pieces  was  found  using  the  DIM,  shadowgraph,  and  planimeter  methods.  These 
methods  compute  the  area  of  a  debris  piece  at  several  orientations.  The  mean  is  then  computed  for  the  respective 
debris.  Figure  5  shows  the  measured  average  area  versus  diameter  for  the  fragments  examined.  The  diameter  is  the 
characteristic  dimension  as  discussed  earlier. 

As  can  be  seen  in  Figure  5,  there  appears  to  be  a  fairly  regular  relationship  between  the  "characteristic"  dimension 
and  the  average  area  for  the  CU-6470  debris  fragments.  Several  types  of  relationships  have  been  fit  to  this  data.  It 
is  thought  that  a  power  law  should  provide  the  most  meaningful  fit  to  the  data.  The  power  law  equation  becomes: 
Area  =  0.859*(diameter)‘  Where  the  area  is  in  cm"  and  the  diameter  is  in  cm.  Looking  at  the  log-linear  data 
space  shown  in  Figure  6,  however,  makes  us  wonder  if  there  should  be  different  relationships  depending  on  the  size 
regime  of  the  object.  Further  analysis  of  more  fragments  is  necessary  to  fully  understand  the  relationships  and  its 
variation. 


Area  Versus  Diameter  For  CU-6470 
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Figure  5:  Measured  average  cross  sectional  areas  versus  diameter  for  CU-6470  objects. 
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Area  Versus  Diameter  For  CU-6470 
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Figure  6:  Log  of  Measured  average  cross  sectional  areas  versus  diameter  for  CU-6470  objects. 


Avg.  Area  Versus  Diameter  For  CU-6470 
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Figure  7:  Log  of  estimated  and  measured  average  cross  sectional  area  versus  diameter  for  CU-6470  objects. 

From  a  modeling  point  of  view  it  is  desirable  to  have  a  simple  relationship  between  fragment  dimension  and 
average  cross  section  as  found  earlier.  Often  the  cross  sectional  area  has  been  estimated  by  assuming  that  the  debris 
object  is  a  sphere  with  diameter  equal  to  the  characteristic  dimension.  Figure  7  shows  estimated  average  areas 
based  on  the  characteristic  dimensions  along  with  the  measured  average  areas.  It  can  be  seen  that  the  classic 
assumption  yields  reasonable  estimates  for  the  cross  sectional  area  for  all  but  the  largest  objects.  The  only  shape 
that  appears  to  work  better  is  the  polyhedral  shape.  The  estimated  average  areas  come  from:  assuming  the  object  to 
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be  a  sphere  with  the  characteristic  dimension  as  the  diameter,  assuming  a  cube  with  sides  equal  to  the  measured 
dimensions,  and  a  polyhedral  with  the  measured  dimensions  representing  the  main  axes.  The  polyhedral  shape  can 
be  thought  of  as  two  rhombus  based  pyramids  attached  at  the  base.  It  can  be  seen  that  the  measurements  generally 
lie  between  the  sphere  and  polyhedral  shape  values.  A  spheroid  shape  is  slightly  better  than  the  spherical  model,  but 
not  enough  to  justify  its  increased  complexity. 

Normalized  Relative  Area  Frequency  Distribution 

The  normalized  relative  area  frequency  (NRAF)  distributions  for  several  objects  from  the  CU-6470  shot  have  been 
determined.  A  non-dimensional  relative  area  is  found  by  dividing  each  measured  area  by  the  average  area  for  that 
piece.  The  relative  area  distribution  curve  is  normalized  so  that  the  area  under  the  curve  is  equal  to  one.  The  NRAF 
can  then  be  compared  for  all  pieces  regardless  of  the  actual  area. 

The  NRAF  distributions  are  interesting  in  that  they  appear  to  depend  on  the  “shape”  of  the  object.  Figures  8  to  10 
show  the  idealized  (theoretical)  NRAF  distributions  and  the  DIM  measured  NRAF  distributions  for  several  idealized 
shapes;  sphere,  cube,  and  flat  plate.  The  “ideal”  distributions  come  from  a  Monte-Carlo  simulation  using  jOOO 
random  orientations.  The  DIM  distributions  are  slightly  spread  out  with  respect  to  the  ideal  distributions  due  to  the 
small  errors  in  estimating  the  area. 


NRAF  For  A  Sphere 


Figure  8:  Idealized  and  measured  NRAF  distributions  for  a  sphere 
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NRAF  For  A  Cube 


Figure  9:  Idealized  and  measured  NRAF  distributions  for  a  cube. 

NRAF  For  A  Flat  Plate 


Relative  Area  (0.10) 

Figure  10:  Idealized  and  measured  NRAF  distributions  fora  flat 
plate. 


Shape  Factor 

The  shape  of  debris  affects  the  lethality,  decay,  and  ability  to  observe  debris  objects. A  preliminary  shape  factor 
derived  from  the  NRAF  and  aspect  ratio  distributions  has  been  developed.  The  aspect  ratio  is  the  ratio  of  the 
characteristic  dimensions  of  an  image.  These  factors  may  help  in  modeling  the  lethality,  decay  and  observability  of 
fragments  and  will  be  explained  in  this  section. 
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Figure  1 1  compares  the  NRAF  distribution  of  CU-6470  Piece  #9  to  that  of  a  flat  plate.  It  can  be  seen  that  the 
distributions  are  similar.  The  differences  at  the  ends  are  easily  explained  by  the  slight  curvature  of  Piece  #9,  which 
is  shown  in  Figure  1.  This  slight  curvature  means  that  the  "edge  on"  images  will  always  have  a  greater  area  than  the 
edge  of  an  idealized  flat  plate.  Therefore  there  will  be  no  small  area  measurements  and  the  average  area  will  be 
slightly  higher  than  for  a  flat  plate.  This  slightly  higher  average  and  the  slight  curvature  also  mean  that  there  will  be 
few  observations  of  large  areas. 


NRAF  For  CU-6470  #9  &  Flat  Plate 


Figure  1 1 :  NRAF  distribution  for  CU-6470  Piece  #9  and  DIM  results  for  a  flat  plate. 

This  can  be  contrasted  to  the  NRAF  distribution  shown  in  Figure  12  for  Piece  #6,  which  is  crumpled.  This  more 
“shapely”  object  has  a  distribution  more  like  a  cube  or  a  rod.  Figure  13  compares  the  NRAF  for  objects  that  had 
highly  twisted  shapes  (“shapely”  pieces  such  at  2,  4,  6,  and  a  sphere)  with  those  pieces  that  were  more  plate-like 
(“flatter"  pieces  9,  12,  and  flat  plate).  The  data  for  this  figure  was  taken  from  DIM  data  for  the  specified  objects. 
The  NRAF  distributions  differ  depending  on  the  shape  of  the  object.  This  makes  sense  when  compared  with  the 
NRAF  for  the  ideal  shapes. 
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NRAF  ForCU-6470  #6 
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Figure  12:  NRAF  for  a  crumpled  piece,  #6 


Average  NRAF  For  Several  Objects 

2.5 


Relative  Area  (0.10) 

Figure  13:  NRAF  for  ’’Shapely"  and  "Flatter"  objects 

The  proposed  "Shape  Factor"  based  on  the  NRAF  is  the  standard  deviation  of  the  frequency  values.  The  more 
spherical  an  object  becomes  the  lower  the  standard  deviation  is  since  the  object  has  the  same  cross  section  from 
every  orientation.  Conversely  the  more  like  a  flat  plate  the  object  becomes  the  larger  the  standard  deviation 
becomes.  This  also  agrees  with  the  "Shape  Factor"  based  on  the  characteristic  dimensions  for  the  pieces  and  visual 
observations.  The  characteristic  dimension  shape  factor  is  based  on  the  difference  ratios  between  the  characteristic 
dimensions.  The  difference  ratios  are  defined  as:  =  (A-C)/A;  =  (B-C)/B.  A,  B,  and  C  are  the  3 

dimensions  measured  for  the  piece,  from  longest  to  smallest.  As  these  ratios  are  defined,  it  is  obvious  that  a  sphere 
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will  have  difference  ratios  always  equal  to  zero  and  a  flat  plate  will  have  AC  and  BC  difference  ratios  approaching  1 
since  the  thickness  of  the  plate  is  assumed  to  be  small.  The  two  methods  for  determining  the  qualitative  shape  are 
compared  in  Figure  14.  The  grouping  of  the  objects  from  flatter  to  more  shapely  can  also  be  seen  in  Figure  15, 
which  plots  the  B/C  and  A/C  difference  ratios.  When  these  two  ratios  are  multiplied  by  one  another,  the  two  shape 
types  are  spread  out  well  on  a  linear  scale.  This  multiplication  of  the  A/C  and  B/C  difference  ratios  appears  to  be  a 
good  method  to  discriminate  between  the  apparent  shape  of  debris  which  corresponds  well  to  the  NRAF  of  the 
actual  debris  piece.  However,  the  characteristic  dimensions  are  only  useful  for  ground  based  fragments,  as  the  on- 
orbit  fragments  are  not  easily  measured. 

The  characteristic  dimensions  and  the  NRAF  distribution  for  the  debris  pieces  are  included  in  the  appendix. 
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Figure  14:  Comparison  of  shape  factor  parameters  for  CU-6470  pieces 
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Difference  Ratio  B/C  to  A/C 
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Figure  15:  Difference  ratio  parameters  forCU-6470  Pieces 


Summary  /  Conclusions 

The  purposes  of  this  research  have  been  to  examine  methods  for  determining  the  cross  sectional  area  of  debris 
objects  and  analyzing  the  obtained  data.  For  laboratory  pieces,  the  area  can  be  measured  using  several  different 
techniques.  A  digital  camera  is  used  to  image  the  debris  piece  and  a  known  reference  object  together  at  a  large 
number  of  orientations.  The  area  of  the  object  at  each  orientation  can  then  be  estimated  by  analyzing  the  digital 
images,  or  by  using  a  planimeter.  This  will  give  an  average  area  as  well  as  a  distribution  of  areas.  The  main 
advantage  of  the  image  analysis  method  is  that  the  area  at  each  orientation  can  be  determined  fairly  quickly  and 
accurately. 

The  results  show  that  there  appears  to  be  a  relationship  between  average  area  and  characteristic  dimensions.  Also 
the  normalized  relative  area  frequency  distribution  (NRAF)  as  well  as  object  aspect  ratios  may  describe  the  shape  of 
an  object.  Both  of  these  results  could  lead  to  improvements  in  the  modeling  of  debris  fragments  and  the  debris 
environment.  Shape  and  area  especially  affect  decay  rates,  penetration  analysis,  and  reduction  of  observational 
data.  Further  data  and  study  is  required  to  clarify  these  observations. 
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Appendix 


•  Characteristic  dimensions,  area  and  shape  factors  for  the  CU-6470  debris 

•  Normalized  relative  area  frequency  (NRAF)  distributions  for  the  CU-6470  debris. 
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A  Look  At  Characteristic  Dimensions  &  Dimensions  in  cm 

Dimension  Fractions  A/B  =  (A-B)/A 
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ABSTRACT 

The  current  performance  issues  related  to  a  single  receiver 
Synthetic  Aperture  Radar  (SAR)  were  studied  and  illustrated. 
Specifically,  the  tradeoffs  between  swath,  resolution  and 
ambiguity  in  both  the  across-track  and  along-track  directions  were 
considered  and  documented.  To  improve  upon  current  performance, 
it  is  proposed  that  the  SAR  be  modeled  in  a  multiple  receiver 
environment.  To  physically  illustrate  the  current  situation  and 
proposed  solution,  a  SAR  simulator  was  constructed  using  the  C 
programming  language.  A  square  image  was  used  to  represent  a 
target  area.  A  single  receiver  SAR  environment  was  developed  and 
the  data  obtained  clearly  demonstrated  the  shortcomings  of  the 
present  system.  Subsequently,  a  multiple  receiver  SAR  environment 
was  created  and  the  data  collected  has  proven  that  a  multiple 
receiver  SAR  environment  produces  higher  performance  with  respect 
to  swath  width,  resolution  and  ambiguity.  Specifically,  it  has 
been  shown  that  in  a  multiple  receiver  SAR  environment,  it  is  now 
possible  to  obtain  wide  swath,  high  resolution,  and  unambiguous 
SAR  images  within  a  wide  range  of  pulse  repetition  frequencies. 
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WIDE  SWATH,  HIGH  RESOLUTION,  UNAMBIGUOUS 
SYNTHETIC  APERTURE  RADAR  (SAR)  IMAGING 
USING  DIGITAL  BEAMFORMING  ARRAYS 


Jeunes  M.  Stiles 


1  INTRODUCTION 
1  •  1  Overview 

Synthetic  Aperture  Radar  [1]  is  a  powerful  surveillance  tool 
that  can  perform  a  multitude  of  functions  in  an  all-weather  day 
and  night  situation.  In  the  military  area,  SAR  may  be  used  as  a 
ground  surveillance  system  to  monitor  troop  and  equipment 
activity.  In  the  civilian  area,  SAR  is  a  popular  choice  for 
weather  monitoring,  ocean  surveillance  and  deforestation 
assessment . 

1.2  SAR  Imaging  Performance 

There  are  two  critical  criteria  that  a  SAR  needs  to  fulfill 
in  order  to  perform  optimally.  Firstly,  it  must  be  able  to  map  a 
wide  spatial  area  in  order  to  locate  all  objects  of  interest.  In 
addition,  it  must  also  be  able  to  exhibit  fine  resolution  so  as  to 
specifically  identify  particular  objects  on  the  ground. 
Unfortunately,  these  two  criteria  are  not  easily  achieved  at  the 
same  time  because  of  a  tradeoff  that  is  currently  inherent  in 
standard  SAR  sensors :  observations  of  high  spatial  resolution  are 
achieved  unambiguously  only  when  the  width  of  the  observed  area  is 
limited.  Given  that  the  converse  of  the  previous  statement  is 
also  true,  it  is  now  clear  that  this  limitation  could  seriously 
inhibit  the  ability  of  a  standard  SAR  sensor  to  produce 
unambiguous,  wide  swath,  and  high  resolution  images. 
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1.3  Ongoing  Research 

There  has  been  much  research  performed  in  this  area  to 
attempt  to  limit  the  impact  of  the  above  restriction.  Possible 
techniques  for  obtaining  wide  swath  SAR  images  are  presented  in 
[2]  and  [3]  .  These  references  primarily  focus  on  techniques  now 
referred  to  as  Multiple  Elevation  Beam  (MEB)  and  Multiple  Azimuth 
Beam  (MAB) .  Whilst  these  techniques  are  effective,  they  each 
possess  shortcomings.  With  MEB  SAR,  we  are  unable  to  obtain  a 
continuous  swath  in  the  across -track  direction  due  to  the  fact 
that  the  sensor  is  unable  to  receive  whilst  transmitting.  With 
MAB  SAR,  there  remains  unresolved  ambiguities  in  the  doppler 

direction  due  to  overlap  of  the  sidelobe  beams  with  that  of  the 
mainlobe.  Yet  another  technique  [4]  involving  the  use  of  a 

broader  elevation-plane  beamwidth  appears  to  be  computational 
intensive.  Finally,  efficient  filtering  methods  [5]  to  improve 
upon  the  resolution  of  SAR  images  have  also  been  offered,  but  (as 
they  use  a  variation  of  MAB)  assume  that  ambiguities  have  been 
resolved .  In  addition,  in  all  the  cases  mentioned  above , 

techniques  and  theories  presented  have  not  been  physically 

illustrated  with  a  SAR  imaging  simulator. 

1 • 4  Aim 

The  aim  of  this  project  is  to  discuss  and  validate  the 
concepts  leading  to  the  development  of  a  SAR  model  that  utilizes 
the  concept  of  multiple  receive  apertures  and  digital  beamforming 
techniques  [6,7]  to  produce  unambiguous,  wide  swath,  and  high 
resolution  SAR  images. 
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2 


DISCUSSION  OF  PROBLEM 


2.1  Current  Situation 

Figure  2.1  shows  the  basic  geometry  of  the  SAR  model.  An 
airborne  SAR  traveling  at  a  speed  of  v,  and  at  a  height  h  above 
the  ground,  will  image  over  a  time  T  a  total  area  of  R^vT,  where  R^ 
is  the  ground  swath  coverage  in  the  direction  of  range. 


Figure  2.1:  Basic  Geometry  of  the  SAR  Model 

We  shall  say  that  this  imaged  area,  A,  has  an  along-track 
length  of  1^  and  an  across- track  length  of  1^.  Therefore, 

A  =  l^ly=R3vT.  (2.1) 

A  standard  SAR  sensor  with  a  single  receiver  of  bandwidth  B 
will  amass  a  total  of  BT  independent  samples  over  an  observation 
time  of  T  seconds .  With  this  data,  an  unambiguous  image  of  BT 
pixels  may  now  be  constructed. 


The  SAR  sensor  must  not  illuminate  more  than  BT  resolution 


cells  if  we  wish  to  avoid  ambiguity.  If  there  are  more  than  BT 
pixels  illuminated,  then  the  scattered  energy  from  the  extra  area 
will  appear  in  pixels  of  the  imaged  area,  thus  causing  ambiguity. 
We  are  at  liberty  to  develop  a  large  image  of  moderate  resolution, 
or  a  smaller  image  of  high  resolution.  But  in  both  cases,  we 
remain  constrained  by  the  time-bandwidth  product  so  long  as  we 
wish  to  produce  an  unambiguous  image. 

To  avoid  ambiguities,  the  pulse  repetition  frequency  (prf) 
has  to  be  carefully  chosen.  We  require  a  low  pulse  repetition 
frequency  to  avoid  range  ambiguities.  At  the  same  time,  a  high 
pulse  repetition  frequency  is  required  to  eliminate  ambiguity  in 
the  doppler  direction.  If  the  SAR  antenna  beamwidth  illuminates 
more  than  BT  resolution  cells  over  an  observation  time  T,  then  no 
value  of  the  pulse  repetition  frequency  can  be  found  to  produce  an 
unambiguous  image . 


2.2  Proposed  Solution 

If,  however,  we  were  now  to  implement  N  receive  apertures  at 
the  antenna,  then  the  total  number  of  independent  samples 
available  would  increase  to  NBT.  This  now  enables  us  to  obtain  an 
image  with  either  greater  resolution  or  larger  size .  As  an 
example,  let  us  consider  the  implementation  of  a  three  receive 
aperture  antenna  array.  A  graphical  illustration  of  this  is  shown 
in  Figure  2.2.  We  observe  the  existence  of  a  low  transmit  pulse 
repetition  frequency  to  accommodate  a  wide  swath  width.  This 
results  in  the  synthesized  aperture  being  inadequately  sampled. 
However,  the  spatially  displaced  receive  apertures  now  provide  the 
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additional  samples  and,  in  essence,  enable  the  synthesized  receive 
aperture  to  now  exhibit  spatial  samples  of  a  SAR  with  N  times  the 
pulse  repetition  frequency. 


received 

receive  transmit  receive 


Antenna 

Aperture 


t  f  t  t  t 

Transmit  Spatial  Sampling 

t  t  t  t  t  t  t  t  t  t  t  t  t 

Receive  Spadal  Sampling 

Figure  2.2:  Spatial  Sampling  of  the  SAK  Multi-Aperture  Model 

Since  the  single  transmit  element  shares  an  aperture  with 
the  center  receive  element,  we  can  now  say  that  the  along-track 
illumination  pattern  of  each  of  the  receive  elements  and  the 
single  transmit  element  are  identical  and  will  have  a  beamwidth 
three  times  larger  than  the  original  single  receive  aperture. 
When  added  correctly,  then  the  azimuthal  response  of  these  three 
sub  apertures  will  be  equivalent  to  that  of  a  single  larger 
receive  aperture .  This  now  provides  us  with  a  beamwidth  narrow 
enough  to  eliminate  along-track  ambiguity. 

The  three  element  receiver  is  essentially  implemented  as  a 
phased  array.  Since  each  element  is  individually  and  coherently 
detected,  digital  beamforming  may  now  be  used  in  the  processing  of 
data.  Because  of  the  broader  beamwidth  of  each  sub  aperture 
(explained  above) ,  a  single  target  may  be  observed  long  enough  to 
obtain  a  fine  along- track  resolution.  At  the  same  time,  the 
coherent  adding  of  the  receiver  channels  will  now  ensure  that 
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there  will  be  a  sufficiently  narrow  beam  in  the  doppler  direction 
to  eliminate  along-track  ambiguities. 

2.3  Multiple  Receiver  Antenna  Design 

The  geometry  of  the  radar  antenna  with  respect  to  the  object 
coordinate  axes  is  shown  as  Figure  2.3.  We  should  note  here  that 
the  radar  antenna  axes  do  not  have  the  same  orientation  as  the 
object  coordinate  axes.  The  radar  antenna  is  tilted  towards  the 
ground  such  that  the  normal  axis  out  of  the  plane  of  the  radar 
antenna,  z',  makes  an  angle  0  with  the  z  direction  of  the  object 
coordinate  axis . 


Figure  2.3:  Radar  Axitexma  Coordinate  Geometry 
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Specifically,  the  relationships  between  the  radar  antenna 
coordinate  axes  and  the  object  coordinate  axes  may  be  stated  as 
follows : 

x'=X,  (2.2) 
y'  =  (-cos0)y-(sin6)z  ,  (2.3) 

and  t!  =  (sin  0)y  -  (cos  0)z .  (2.4) 

The  multiple  receive  aperture  concept  was  illustrated  in 
Figure  2.2  for  a  three  receive  aperture  system.  In  this  project, 
the  simulation  will  ultimately  move  towards  a  nine  receive 
aperture  model.  In  that  case,  a  possible  design  could  be  that 
shown  in  Figure  2.4. 

We  notice  here  that  the  nine  receive  apertures  are  laid  out 
in  a  three  by  three  orientation.  We  should  also  recognize  that  it 
would  be  also  possible  to  orientate  the  apertures  in  a  nine  by  one 
or  a  one  by  nine  orientation  as  well.  The  decision  on  the 
orientation  of  the  apertures  will  depend  on  the  selection  of  the 
pulse  repetition  frequency.  A  choice  of  a  high  pulse  repetition 
frequency  would  require  us  to  orientate  the  apertures  in  a  one  by 
nine  orientation  (in  the  across-track  direction)  so  as  to 
eliminate  ambiguity  in  the  direction  of  range.  Similarly,  a  low 
pulse  repetition  frequency  selection  will  require  that  the 
apertures  be  lined  up  in  a  nine  by  one  orientation  (in  the  along- 
track  direction)  to  eliminate  ambiguity  in  the  doppler  direction. 

It  is  important  to  note  here  that  all  elements  in  the  array 
are  of  the  same  size,  i.e.  they  each  have  a  length  and  a  width 
Ly.  The  single  transmitter  is  also  located  at  the  center  of  the 
array.  The  position  vector  r/  represents  the  distance  and 
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direction  from  the  center  of  the  single  transmit  element  to  the 
center  of  any  of  the  multiple  receive  elements.  Whereas  the 
conventional  approach  has  been  to  design  a  single  transmitter-* 
receiver  unit,  we  now  instead  propose  implementing  an  antenna 
array  that  maintains  a  single  transmitter  but,  in  addition, 
possesses  multiple  receive  elements  oriented  in  a  chosen  manner. 

L, 


Figure  2.4:  Nine  Receive  Element  SAR  Antenna  Design 

Once  again,  we  note  that  the  x'  axis  corresponds  to  the 
along- track  direction  and  the  projection  of  the  y'  axis 
corresponds  to  the  across-track  direction.  This  entire  antenna  is 
tilted  to  face  the  ground  in  such  a  manner  that  its  normal  will 
make  an  angle  of  0  with  the  height  axis  of  the  conventional  axes. 
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SOLUTION  METHOIXDLOGY 


3.1  Parameter  Design 


Given  that  the  SAR  has  an  operating  wavelength  X,  its  along- 


track  pixel  resolution  can  be  determined  from  [8]  to  be 


Since  the  slant  range  R  is  defined  as 


equation  (3.1)  may  now  be  written  as 


(3.1) 


(3.2) 


2vTcos0 


(3.3) 


Given  that  c  is  the  speed  of  light,  the  across- track  pixel 
resolution  of  the  SAR  may  also  be  found  from  [8]  to  be 


2Bsin  0 


(3.4) 


We  set  a  standard  requiring  square  pixels.  Therefore 


Ax  =  Ay  . 


(3.5) 


Subsequently,  by  equating  and  rearranging  terms  in  equations 
(3.3)  and  (3.4),  we  can  thus  arrive  at 


B  _  cvcot0 
T  ~  hX 


(3.6) 


Making  the  mathematical  assumption  that 

cot  0  =  1  , 


(3.7) 


and  using  the  relationship  between  X  and  the  radar  frequency,  f. 


we  can  simplify  the  time -bandwidth  ratio  in  equation  (3,6)  to  be 


Based  on  typical  radar  design  parameters  obtained  from  [9], 
we  set  the  radar  height,  speed,  operating  frequency  and  antenna 
tilt  angle  to  be: 


Radar  Height: 

h  =  1 83km  , 

(3.9) 

Radar  Speed : 

V  =  7.8km /s  , 

(3.10) 

Radar  Frequency: 

f  =  lOGHz  , 

(3.11) 

and 

Tilt  Angle: 

e=45°. 

(3.12) 

Using  the  above  values,  we  can  next  numerically  determine 
the  time -bandwidth  ratio  in  equation  (3.8)  to  be 


B  8  9 

-  =  4.26xl0*s“^ 
T 


(3.13) 


We  select  a  square  image  containing  256  pixels  in  both 
length  and  width.  If  our  aim  is  to  reconstruct  this  image 
unambiguously  with  nine  apertures,  then 

9BT  >  256  X  256  =  65536  .  (3.14) 

This  implies  that 

BT>7281.  (3.15) 

Since  e(guation  (3.15)  merely  stipulates  a  lower  bound  for 
the  time-bandwidth  product,  we  are  able  to  specify  a  higher  value 
as  well.  Let  us  say  that 

BT  =  10650  .  (3.16) 

Solving  equations  (3.13)  and  (3.16)  simultaneously,  we  can 
determine  that 

B  =  2.13MHz,  (3.17) 

and  also  that 

T  =  5ms  .  (3.18) 
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with  these  values  obtained  we  can  confirm  that 


Ax  =  Ay  <=  100m  . 


(3.19) 


This  is  now  in  accordance  with  equation  (3.5). 

To  eliminate  range  ambiguities,  we  can  determine  from  [8] 
that  the  prf  is  constrained  by: 


f  rf  ^  - 

21,  sine 


(3.20) 


To  eliminate  doppler  ambiguities,  however,  we  also  require 


from  [8]  that 


f  rf 

XR 


(3.21) 


Equations  (3.20)  and  (3.21)  clearly  illustrate  once  again 
the  current  difficulty  faced  in  reducing  ambiguity  in  both  the 
range  and  doppler  directions. 

Rearranging  (3.20)  and  (3.21),  we  can  arrive  at 


ly  ^  - 

2fprfSin0 


(3.22) 


(3.23) 


The  maximum  area  may  now  be  determined  by  setting  the  values 
of  1^  and  ly  to  their  individual  maximum  values.  Therefore,  the 
maximum  area  may  be  found  by  taking  the  product  of  equations 
(3.22)  and  (3.23)  at  the  maximum  values  of  1^  and  1^. 
Specifically, 


1 1  =  ---- 

4vsine 


(3.24) 


Substituting  (3.2)  into  (3.24),  we  can  obtain 
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Replacing  (3.25)  with  its  numerical  values, 

=  105. 6x10^ .  (3.26) 

since  the  image  is  square, 

=  ly  =  10.27km  .  (3.27) 

In  our  proposed  solution  design,  we  now  specify  the  presence 
of  nine  receive  apertures  instead  of  just  one.  Therefore,  we  now 
expect  our  maximum  area  to  be  nine  times  larger.  Given  that  the 
pixel  resolution  in  both  the  range  and  doppler  directions  is 
approximately  100m,  we  may  now  set  the  pixel  length  in  both  range 
and  doppler  directions  to  be,  say,  105m.  Therefore,  the  new  image 
length  and  width  is  now 

1,  =  ly  =  26.88km  .  (3.28) 

Finally,  we  are  now  able  to  return  to  equation  (3.20)  and 
calculate  that  in  order  to  eliminate  range  ambiguities, 

fprf<  7891Hz.  (3.29) 

Once  again,  as  equation  (3.29)  is  merely  an  upper  bound,  we 
can  specify  that  in  order  to  eliminate  range  ambiguities, 

fprf=7KHz.  (3.30) 

Similarly,  we  are  now  also  able  to  return  to  equation  (3.21) 
and  calculate  that  in  order  to  eliminate  doppler  ambiguities, 

f prf  >  54009Hz  .  (3.31) 

As  equation  (3.31)  also  stipulates  a  lower  bound,  we  can 
next  say  that  in  order  to  eliminate  doppler  ambiguities, 


fprf=60KHz. 


(3.32) 


The  along-track  aperture  length  may  now  be  theoretically 
calculated  as 

T  __ 

L,  = - «29cm  .  (3.33) 

COS0 

similarly,  the  across- track  aperture  width  may  now  be 
theoretically  found  to  be 

L  = — (3.34) 

^  ly  cos^  0 

However,  an  alternate  design  method  could  also  be  to  find 
the  values  of  and  such  that  the  nearest  point  of  ambiguity  in 
both  the  range  and  doppler  directions  lies  within  the  location  of 
the  first  null  of  the  radar  antenna  pattern.  Using  this  method, 
we  find  that 

L,=  25.5cm,  (3.35) 

and  also  that 

Ly  =  37.1cm  .  (3.36) 

This  concludes  the  parameter  design  phase  of  the  simulation. 
At  this  point,  we  have  all  necessary  design  parameters  to  simulate 
several  different  situations. 

We  notice  that,  as  expected,  there  are  conflicting 
requirements  for  the  choice  of  a  single  radar  pulse  repetition 
frequency.  We  are  required  to  select  a  value  of  60KHz  for  the 
pulse  repetition  frequency  if  we  wish  to  eliminate  ambiguities  in 
the  doppler  direction.  But,  we  are  also  required  to  select  a 
value  of  7KHz  for  the  pulse  repetition  frequency  if  we  wish  to 
eliminate  ambiguities  in  range .  This  conflicting  dependency  on 
the  pulse  repetition  frequency  is  the  problem  that  we  anticipate 
the  multiple  receive  aperture  SAR  model  to  be  able  to  resolve. 
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3.2  A  Numerical  Example 

With  the  values  obtained,  it  is  now  appropriate  to  develop  a 
numerical  example  to  physically  illustrate  the  present  problem  and 
the  proposed  solution. 

It  should  be  noted  here  that  the  design  methodology  in  the 
previous  section  centers  on  a  radar  with  unfocused  apertures, 
whereas  in  reality  the  SAR  is  often  operated  with  fully  focused 
apertures.  This  was  done  only  to  reduce  computation  time  in  the 
simulation.  The  problem  of  obtaining  wide  swath,  unambiguous , 
high  resolution  radar  images  remains  the  same,  regardless  of  the 
type  of  aperture  used.  To  prove  its  validity,  the  following 
numerical  example  will  use  the  derived  values  to  illustrate  the 
defined  problem  in  the  case  of  a  focused  SAR. 

In  the  case  of  a  focused  SAR,  the  along- track  pixel 
resolution  is  defined  as 


where  1  is  the  length  of  the  antenna. 

Using  the  value  for  1  found  in  (3.35)  we  can  say  that 

Ax  =  0.1275m  .  (3.38) 

Based  next  on  the  assumption  of  square  cell  resolution. 

Ay  =  Ax  =  0.1275m  .  (3.39) 

Returning  to  equation  (3.4),  we  can  find  the  bandwidth  to  be 

B  =  1.67xlO’Hz .  (3.40) 

Similarly,  we  may  next  return  to  equation  (3.1)  and 
determine  the  new  observation  time  to  be 

T  =  3.9s.  (3.41) 
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We  can  see  now  that  a  fully  focused  SAR  operating  in  our 
defined  environment  would  actually  have  a  bandwidth  of  1.67GHz  and 
a  (longer)  observation  time  of  3.9s.  This  will  only  result  in  a 
larger  time-bandwidth  product  than  that  calculated  earlier .  The 
qualitative  treatment  of  the  problem  from  this  point  on  remains 
the  same  whether  one  is  dealing  with  an  unfocused  aperture  or  a 
focused  one. 

Assuming  the  physical  aperture  length  of  25.5cm,  equation 
(3.21)  will  require  that  in  order  to  eliminate  doppler 
ambiguities , 

fprf>  61176Hz.  (3.42) 
However,  since  the  maximum  unambiguous  swath  width  is 


We  can  now  see  that  it  is  possible  to  obtain  more  time- 
bandwidth  samples  by  simply  increasing  the  number  of  antenna 
elements .  If  we  now  had  N  elements ,  the  total  number  of  time 
bandwidth  samples  would  now  have  increased  by  a  factor  of  N  to 
NBT. 

Let  us  now  specify  a  new  swath  width  requirement  of  three 
times  the  value  found  in  equations  (3.44)  and  (3.47).  That  is,  we 
now  want  the  swath  width  to  be 


R5=  10500m.  (3.48) 

To  eliminate  range  ambiguity  at  this  new  swath  width, 
equation  (3.20)  will  require  that  the  new  pulse  repetition 
frequency  be  reduced  by  a  factor  of  three.  Specifically,  we  now 
require 

fprf  <  20203Hz  .  (3.49) 

Obviously,  this  value  of  pulse  repetition  frequency  does  not 
agree  with  that  defined  earlier  in  equation  (3.42),  and  will  cause 
ambiguity  in  the  doppler  direction  unless  the  antenna  length  is 
altered.  Now,  to  satisfy  equation  (3.49),  we  require  that  the  new 
antenna  length  be 


=  25.5cmx3  =  76.5cm  . 


(3.50) 


Finally,  returning  to  equation  (3.37), 
length  would  result  in  degradation  of  the 
resolution.  Now, 


this  new  aperture 
along- track  pixel 


Ax  =  0.1275mx3  =  0.3825m  .  (3.51) 

It  is  clear  that  we  have  now  traded  swath  width  for 
resolution.  This  is  the  current  problem  we  face. 
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If,  however,  we  instead  implemented  the  receive  antenna  as 
three  individual  apertures  each  of  length  25.5cm,  then  the  number 
of  time-bandwidth  samples  increases  by  three  times  to  3BT  instead 
of  just  BT.  More  importantly,  we  can  see  from  equations  (3.37) 
and  (3.43)  that  we  can  now  achieve  the  enlarged  swath  width  of 
10500m  whilst  maintaining  the  (original)  superior  pixel  resolution 
of  0 . 1275m  because  we  do  not  alter  the  physical  length  of  each 
aperture .  Rather,  we  have  instead  employed  the  use  of  multiple 
receive  apertures  to  provide  us  this  increase  in  overall  antenna 
length . 

Because  the  transmit  antenna  aperture  occupies  the  same 
location  as  the  center  receive  antenna  aperture,  the  illumination 
patterns  in  the  along-track  direction  of  the  single  transmit 
aperture  and  the  three  receive  apertures  are  identical.  In 
addition,  each  is  also  three  times  as  broad  as  the  original 
enlarged  aperture  of  length  76.5cm.  From  this  broader  beamwidth 
of  each  sub  aperture,  a  single  target  can  be  observed  long  enough 
to  achieve  a  fine  along-track  resolution.  At  the  same  time,  this 
beamwidth  is  narrow  enough  to  attenuate  ambiguities  in  the  along- 
track  direction. 

From  this,  it  is  evident  that  the  use  of  multiple  receive 
apertures  has  now  allowed  us  to  potentially  obtain  wide  swath, 
high  resolution,  and  unambiguous  radar  images .  It  is  now  an 
appropriate  time  to  move  towards  a  computer  simulation  environment 
and  put  our  proposed  theoretical  solution  to  a  practical  test  in 
order  to  determine  how  effective  this  design  is. 


18-19 


3.3  Test  Matrix 


Following  the  completion  of  the  parameter  design,  a  test 
matrix  was  developed  to  list  the  situations  to  be  addressed.  This 
test  matrix  is  listed  as  Table  3.1  and  shows  the  different  input 
parameters  necessary  to  attain  a  particular  test  environment. 


situation 

(KHz) 

L,  (cm) 

(cm) 

Apertures 

1 

7 

25.5 

37.1 

1 

_ 2 _ , 

7 

25.5  *  9 

37.1 

1 

3 

60 

25.5 

37.1 

1 

4 

60 

25.5 

37.1  *  9 

1 

5 

20.5 

25.5 

37.1 

1 

6 

20.5 

25.5  *  3 

37.1 

1 

7 

20.5 

25.5 

37.1  *  3 

1 

8 

20.5 

25.5  *  3 

37.1  *  3 

1 

9 

20.5 

25.5  by  3 

37.1 

3 

10 

20.5 

25.5 

37.1  by  3 

3 

11 

7 

25.5  by  3 

37.1  by  3 

9 

12 

7 

25.5  by  9 

37.1 

9 

13 

60 

25.5  by  3 

37.1  by  3 

9 

14 

60 

25.5 

37.1  by  9 

9 

15 

i  20.5 

25.5  by  3 

37.1  by  3 

9 

Table  3.1:  Test  Matrix  for  Simulation 

As  is  evident,  the  first  eight  situations  in  this  test 
matrix  will  reflect  the  problems  present  in  a  single  receive 
aperture  SAR  environment .  The  pulse  repetition  frequency  and 
aperture  size  will  be  varied  to  test  a  variety  of  situations  with 
the  intention  of  proving  that,  whilst  it  is  possible  to  obtain  an 
image  that  has  high  resolution,  wide  swath,  or  low  ambiguity,  it 
is  not  presently  possible  to  have  all  these  traits  present  in  a 
single  image . 

In  the  single  receive  aperture  SAR  environment,  the  physical 
length  and  width  of  the  receive  aperture  is  altered  to  reflect  the 
particular  test  situation .  For  example ,  in  Situation  8 ,  the 
physical  aperture  length  and  width  is  three  times  as  large  because 
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the  pulse  repetition  frequency  in  that  instance  is  less  than  each 
extreme  by  a  factor  of  three.  There  will  be  further  analysis  of 
each  test  situation  in  the  following  section. 

The  final  seven  situations  will  represent  the  transition  to 
a  considerably  more  efficient  multiple  receive  aperture  SAR 
environment.  Here,  the  pulse  repetition  frequency  will  be  varied 
together  with  the  number  of  apertures  with  the  intention  of 
obtaining  a  single  image  that  possesses  wide  swath,  high 
resolution  and  low  ambiguity. 

In  the  multiple  receive  aperture  SAR  environment,  we  notice 
that  there  is  no  longer  a  physical  change  in  the  size  of  each 
aperture.  All  receive  apertures  (as  well  as  the  single  transmit 
aperture)  are  of  the  same  size.  In  the  multiple  receive  aperture 
SAR  environment,  several  of  these  receive  apertures  are  placed  in 
a  suitable  orientation  depending  on  the  test  situation.  For 
example,  in  Situation  12,  there  are  nine  receive  apertures  placed 
in  the  along-track  direction  due  to  the  presence  of  a  pulse 
repetition  frequency  that  only  eliminates  ambiguity  in  the  across- 
track  direction.  Once  again,  further  analysis  of  each  situation 
will  provided  in  the  following  section. 

The  pulse  repetition  frequency  (fp^^)  is  varied  between  three 
values.  A  value  of  60KHz  reflects  a  situation  in  which  doppler 
ambiguities  will  be  eliminated.  A  value  of  7KHz  reflects  the 
situation  where  we  wish  for  range  ambiguities  to  be  eliminated.  A 
value  of  20.5KHZ  was  also  chosen  as  it  is  less  than  each  of  the 
other  two  test  frequencies  by  a  constant  factor  of  three.  This 
would  represent  a  position  of  "compromise"  between  the  complete 
elimination  of  range  and  doppler  ambiguities  in  the  test 
environment . 
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3.4  Test  Izoage 

Figure  3.1  shows  the  test  image  used  for  the  simulation. 
This  picture  has  (for  decades)  been  an  extremely  popular  choice  of 
input  in  image  processing  research.  It  was  first  used  by 
researchers  at  the  University  of  Southern  California  in  the  1970s 
and  has  since  become  a  de  facto  standard  for  performance 
comparisons.  The  image  consists  of  65536  pixels  oriented  in  a  256 
by  256  manner.  The  intensity  of  each  pixel  ranges  from  zero  (pure 
black)  to  255  (pure  white) .  The  along-track  direction  would  be 
equivalent  to  viewing  this  image  from  east  to  west  (or  left  to 
right)  and  the  across- track  direction  would  be  equivalent  to 
viewing  this  image  from  north  to  south  (or  top  to  bottom) . 


across- 

track 


► 


along-track 


Figure  3.1:  Test  Image  used  as  Target  Area 


18-22 


4  RESULTS 
4 • 1  Situation  1 

In  this  situation,  the  pulse  repetition  frequency  is  set  at 
7KHz  whilst  the  single  receive  aperture  has  a  length  of  25.5cm  in 
the  along-track  direction  and  a  width  of  37.1cm  in  the  across- 
track  direction.  The  receiver  response  is  shown  as  Figure  4.1. 
The  pulse  repetition  frequency  setting  is  sufficient  to  eliminate 
range  ambiguities,  but  not  doppler  ambiguities.  The  aperture  size 
settings  result  in  a  wide  swath  that  illuminates  the  entire  input 
image  area.  However,  the  broad  illumination  beamwidth  now  detects 
points  of  ambiguity  in  the  doppler  direction.  This  is  the  reason 
why  we  observe  repetitions  of  the  input  image  in  the  along-track 
direction.  The  result  is  an  image  that  possesses  wide  swath  but 
poor  resolution  and  high  ambiguity  in  the  along-track  direction. 


Figure  4.1:  Radar  Receiver  Response  for  Situation  1 
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4.2  Situation  2 


The  poor  resolution  observed  in  Figure  4.1  is  a  result  of 
the  wide  beamwidth  illuminating  areas  of  ambiguity  in  doppler.  A 
possible  solution  would  be  to  alter  this  beamwidth  with  the 
intention  of  improving  resolution  and  lowering  ambiguity.  Thus  in 
Situation  2,  the  pulse  repetition  frequency  is  maintained  at  7KHz . 
However,  to  counter  the  effects  of  ambiguity  in  the  along-track 
direction,  the  along-track  aperture  length  of  the  single  receiver 
will  now  be  made  nine  times  larger  than  the  original  value  of 
25.5cm.  The  across  track  aperture  width  remains  at  37.1cm.  This 
translates  into  a  narrow  illumination  area  in  the  along-track 
direction.  In  the  results  shown  in  Figure  4.2,  we  observe  a  thin 
illumination  area  in  the  doppler  direction  that  is  now  able  to 
eliminate  the  ambiguities  present  in  the  along-track  direction. 
The  result  is  an  unambiguous  image  with  high  resolution  but  now 
with  a  narrow  along-track  illumination  width. 


Figure  4.2:  Radar  Receiver  Response  for  Situation  2 
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4.3  Situation  3 


Figure  4.3  shows  the  radar  receiver  response  for  Situation 
3 .  In  this  situation,  the  pulse  repetition  frequency  is  now  set 
at  a  level  of  60KHz.  This  satisfies  the  requirement  for  non¬ 
ambiguity  in  the  along- track  direction,  but  does  not  eliminate 
ambiguity  in  the  across-track  direction.  The  single  receiver 
aperture  size  is  once  again  set  at  25.5cm  in  the  along-track 
direction  and  37.1cm  in  the  across-track  direction.  This  will 
result  in  a  wide  swath  width,  but  our  concern  here  is  that  this 
wide  swath  will  now  detect  ambiguity  in  the  across-track 
direction.  As  expected.  Figure  4.3  now  shows  repetitions  of  the 
input  image  area  in  the  across-track  direction.  The  result  here 
is  an  image  with  wide  swath  but  poor  resolution  and  high  ambiguity 
in  the  across-track  direction. 


Figure  4.3;  Radar  Receiver  Response  for  Situation  3 
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4*4  Situation  4 


We  observe  once  again  that  the  poor  resolution  in  Figure  4.3 
is  a  result  of  the  wide  swath  width  detecting  points  of  ambiguity 
in  range.  A  possible  solution  to  improve  resolution  would  be  to 
narrow  the  swath  width.  Therefore ,  in  this  situation  the  single 
receive  aperture  has  an  along-track  length  of  25.5cm  but  an 
across-track  width  of  nine  times  the  original  value  of  37.1cm. 
The  pulse  repetition  frequency  is  maintained  at  60KHz.  Figure  4.4 
shows  the  results  obtained  for  this  situation.  We  observe  that 
there  is  now  a  narrow  strip  in  the  across-track  direction  that  is 
able  to  eliminate  all  the  ambiguities  in  the  direction  of  range. 
However,  this  has  come  at  the  expense  of  narrow  coverage  in  the 
across-track  direction .  The  result  we  obtain  here  is  an 
unambiguous  image  that  has  high  resolution  but  now  with  a  narrow 
swath  width. 


Figure  4.4:  Radar  Receiver  Response  for  Situation  4 


4 . 5  Situation  5 

Next,  the  pulse  repetition  frequency  was  set  at  a  value  of 
20.5KHZ.  This  is  a  factor  of  three  away  from  either  7KHz  or 
60KHZ.  The  single  receive  aperture  is  also  set  at  its  original 
value  of  25.5cm  in  the  along-track  direction  and  37.1cm  in  the 
across- track  direction.  With  this  aperture  size,  we  expect  the 
entire  image  area  to  be  illuminated.  Figure  4.5  shows  the  results 
obtained  in  this  situation.  With  the  pulse  repetition  frequency 
set  at  a  factor  of  three  away  from  either  extreme,  we  would  thus 
expect  three  times  less  ambiguity  in  both  range  and  doppler 
directions.  In  Figure  4.5,  we  are  able  to  notice  more  of  the 
original  image  than  we  did  in  Figures  4.1  and  4.3.  We  are  also 
able  to  observe  the  distinct  ambiguous  positions  in  either 
direction.  Thus  this  situation  returns  an  image  with  poor 
resolution  and  moderate  ambiguity. 


Figure  4.5:  Radar  Receiver  Response  for  Situation  5 


18-27 


4 • 6  Situation  6 


In  the  next  test  situation,  the  pulse  repetition  frequency 
was  maintained  at  the  intermediate  value  of  20.5KHz.  This  would 
again  lead  us  to  expect  our  resultant  image  to  have  less  ambiguity 
in  both  directions.  However,  in  this  instance,  we  alter  the  value 
of  the  along-track  aperture  length  to  now  read  as  three  times  its 
original  value  of  25.5cm.  The  across-track  aperture  length 
remains  at  37.1cm.  From  this,  we  would  expect  a  slightly  wider 
illumination  beamwidth  than  we  did  in  Figure  4.2.  Figure  4.6 
shows  the  results  obtained.  We  see  that  by  increasing  the  along- 
track  aperture  length,  we  have  eliminated  a  considerable  amount  of 
ambiguity.  However,  ambiguity  remains  in  the  across-track 
direction  due  to  the  fact  that  a  pulse  repetition  frequency  of 
20.5KHZ  is  still  too  low. 


Figure  4.6:  Radar  Receiver  Response  for  Situation  6 
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4.7  Situation  7 


In  Situation  7,  it  is  the  across- track  aperture  width  that 
is  now  altered  to  read  as  three  times  its  original  value  of 
37 . 1cm.  The  along-track  aperture  length  returns  to  its  original 
value  of  25.5cm.  Therefore,  we  expect  a  swath  width  that  is 
slightly  wider  than  that  of  Figure  4.4.  We  maintain  the  pulse 
repetition  frequency  at  the  intermediate  value  of  20.5KHz.  Once 
again,  this  leads  us  to  expect  a  factor  of  three  reduction  in 
ambiguity.  Figure  4.7  shows  the  results  obtained  for  this 
situation.  As  expected,  we  have  eliminated  all  the  ambiguity  in 
the  across-track  direction  by  enlarging  the  across-track  aperture 
width.  However,  a  pulse  repetition  frequency  of  20.5KHz  is  not 
sufficiently  high  enough  to  eliminate  all  the  along-track 
ambiguity  within  the  illumination  area.  We  are  left  with  an 
ambiguous  image  that  has  limited  swath  coverage. 


Figure  4.7:  Radar  Receiver  Response  for  Situation  7 
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4.8  Situation  8 


In  the  final  single  receive  aperture  test  situation,  we  aim 
for  some  sort  of  compromise  between  ambiguity,  resolution  and 
swath  width.  Here,  we  maintain  the  pulse  repetition  frequency  at 
the  intermediate  level  of  20 . 5KHz .  We  alter  the  along- track 
aperture  length  to  be  three  times  its  original  value  of  25.5cm  and 
the  across-track  aperture  width  to  be  three  times  its  original 
value  of  37.1cm.  The  intermediate  pulse  repetition  frequency  will 
reduce  the  ambiguity  by  a  factor  of  three  in  each  direction.  In 
addition,  the  larger  aperture  lengths  in  both  directions  will  also 
be  able  to  reduce  ambiguity  by  a  further  factor  of  three  in  both 
directions.  Thus,  in  this  instance  we  would  expect  an  unambiguous 
image.  Figure  4.8  shows  the  results  obtained  in  this  situation. 
We  observe  that  our  unambiguous  image  prediction  was  true. 
However,  this  has  now  come  at  the  expense  of  a  narrow  swath  width 
(in  range)  and  a  narrow  illumination  beamwidth  (in  doppler) . 


Figure  4.8:  Radar  Receiver  Response  for  Situation  8 


4.9  Situation  9 


This  situation  marks  the  transition  to  a  multiple  receive 
aperture  SAR  environment.  From  this  situation  onward,  all  receive 
apertures  have  a  fixed  length  of  25,5cm  in  the  along-track 
direction  and  37.1cm  in  the  across-track  direction.  Here,  we 
design  three  receive  apertures  lined  up  in  the  along- track 
direction.  The  pulse  repetition  frequency  is  maintained  at  the 
intermediate  value  of  20.5KHz.  The  result  for  this  situation  is 
shown  as  Figure  4,9.  We  see  a  marked  improvement  now  in  the 
performance  as  compared  to  Figure  4.6.  The  additional  apertures 
in  the  along-track  direction  have  allowed  for  a  wider  illumination 
beamwidth  in  Figure  4.9  without  any  further  increase  in  ambiguity 
from  Figure  4.6.  This  was  previously  unachievable  in  the  single 
receive  aperture  environment.  However,  whilst  this  is  an 
improvement  over  the  single  aperture  environment,  the  existing 
across-track  ambiguity  is  clearly  undesirable. 


Figure  4*9:  Radar  Receiver  Response  for  Situation  9 
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4.10  Situation  10 


In  this  situation,  the  pulse  repetition  frequency  is 
maintained  at  20.5KHz.  The  three  receive  aperture  design  is  also 
maintained,  but  these  apertures  are  now  laid  out  in  the  across- 
track  direction.  The  result  for  this  situation  is  shown  as  Figure 
4.10.  Here,  we  also  notice  a  marked  improvement  in  performance  as 
compared  to  Figure  4.7.  Whilst  the  ambiguity  in  the  along-track 
direction  remains  the  same,  the  three  receive  apertures  in  the 
across-track  direction  have  allowed  for  a  wider  swath  width  to  be 
present  in  the  image .  This ,  once  again,  was  an  unachievable  goal 
in  the  single  receive  aperture  environment .  However ,  the 
remaining  ambiguity  is  undesirable  and  clearly  indicates  that 
three  apertures  is  an  insufficient  number  to  return  an  unambiguous 
image . 


Figure  4.10:  Radar  Receiver  Response  for  Situation  10 
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4.11  Situation  11 

This  situation  marks  the  beginning  of  the  nine  receive 
aperture  design  phase.  Here,  we  start  once  again  by  returning  to 
a  pulse  repetition  frequency  of  7KHz  that  will  eliminate  range 
ambiguities  but  not  ambiguities  in  the  doppler  direction.  We  also 
design  the  multiple  receive  aperture  unit  such  that  there  are 
three  apertures  in  the  across-track  direction  and  three  in  the 
along- track  direction.  With  three  apertures  present,  we  expect 
the  ambiguity  in  the  along-track  direction  to  be  reduced  by  a 
factor  of  three.  Figure  4.11  shows  the  results  achieved  for  this 
situation.  This  result  is  a  significant  improvement  over  Figure 
4.1.  However,  because  the  apertures  are  not  oriented  in  the  right 
manner,  there  remains  ambiguity  in  the  along-track  direction. 
With  a  low  pulse  repetition  frequency,  all  ambiguity  in  the 
across-track  direction  is  eliminated  and  it  is  pointless  to  place 
additional  apertures  in  the  direction  of  range. 


Figure  4.11:  Radar  Receiver  Response  for  Situation  11 
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4 • 12  Situation  12 


In  this  situation,  we  work  to  improve  the  result  achieved  in 
Figure  4.11.  From  the  results  achieved  in  Situation  11,  it  is 
clear  that  with  a  pulse  repetition  frequency  of  7KHz,  there  is  no 
longer  a  need  for  additional  apertures  in  the  across-track 
direction.  A  more  appropriate  design  would  be  to  maintain  the 
nine  aperture  design  but  orientate  the  apertures  in  such  a  manner 
that  there  are  nine  apertures  in  the  along- track  direction.  With 
a  low  pulse  repetition  frequency,  the  along-track  direction  is 
where  ambiguity  exists  and,  thus ,  needs  to  be  removed.  Figure 
4.12  shows  the  results  obtained  in  this  situation.  Finally,  we 
have  arrived  at  a  satisfying  result.  We  observe  now  a  resultant 
unambiguous  image  that  possesses  high  resolution  and  wide  swath.  A 
multiple  receive  aperture  SAR  environment  coupled  with  proper 
orientation  of  apertures  has  allowed  us  to  achieve  this  satisfying 
result . 


Figure  4,12:  Radar  Receiver  Response  for  Situation  12 
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4.13  Situation  13 


We  now  refocus  our  attention  on  a  pulse  repetition  frequency 
of  60KHz.  This  value  will  eliminate  ambiguity  in  the  along-track 
direction  but  not  in  the  across- track  direction.  We  return  to  our 
nine  aperture  receiver  design  with  three  apertures  aligned  in  the 
across-track  direction  and  three  aligned  in  the  along-track 
direction.  From  our  results  obtained  in  Situation  11,  we  can 
predict  that  this  may  not  be  the  best  receiver  orientation  to  give 
us  optimum  performance.  Figure  4.13  shows  the  result  obtained  for 
this  situation.  Although  the  multiple  receivers  have  provided  us 
with  a  wide  swath,  there  still  exists  ambiguity  in  the  across - 
track  direction  due  to  incorrect  receiver  orientation.  It  seems 
pointless  here  to  add  receivers  in  the  along-track  direction  when 
our  choice  of  pulse  repetition  frequency  will  already  ensure  that 
no  ambiguity  is  present  in  the  doppler  direction.  The  three 
receive  apertures  in  the  across-track  direction  are  only 
sufficient  to  reduce  ambiguity  by  a  factor  of  three. 


■  >  j"'- 


Figure  4.13:  Radar  Receiver  Response  for  Situation  13 
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4.14  Situation  14 


We  now  attempt  to  improve  upon  our  results  achieved  in 
Figure  4.13.  It  is  clear  that  our  receiver  design  in  Situation  13 
was  incorrect.  A  choice  of  60KHz  as  the  pulse  repetition 
frequency  effectively  eliminates  doppler  ambiguities.  There  is  no 
longer  a  need  to  place  additional  receivers  in  the  along-track 
direction.  Rather,  it  would  seem  that  the  nine  receivers  should 
all  be  placed  in  the  across-track  direction.  This  would  provide 
the  desired  ambiguity  reduction  factor  of  nine  in  the  across-track 
direction.  Figure  4.14  shows  the  another  extremely  satisfying 
result  obtained.  We  have  once  again  managed  to  obtain  an 
unambiguous  image  that  possesses  high  resolution  and  wide  swath 
width.  The  result  obtained  in  Figure  4.14  is  identical  to  that 
obtained  in  Figure  4.12.  This  point  of  note  will  be  extremely 
useful  once  the  results  for  the  following  section  are  finally 
obtained . 


Figure  4.14:  Radar  Receiver  Response  for  Situation  14 
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4.15  Situation  15 


Finally,  we  return  to  a  pulse  repetition  frequency  of 
20.5KHZ.  With  experience  from  previous  situations,  we  know  that 
this  choice  of  pulse  repetition  frequency  reduces  ambiguity  in 
both  along-track  and  across-track  directions  by  a  factor  of  three. 
We  are  now  able  to  predict  that  our  multiple  receive  aperture 
design  must  reduce  the  ambiguity  by  a  further  factor  of  three 
before  we  are  able  to  obtain  an  unambiguous  image.  Thus,  there 
must  be  three  apertures  in  the  across-track  direction  and  three 
apertures  in  the  along-track  direction  of  the  receiver.  The 
results  for  this  situation  are  shown  as  Figure  4.15.  We  have  an 
unambiguous  image  that  has  high  resolution  and  wide  swath.  Also, 
whilst  they  were  all  generated  based  on  different  pulse  repetition 
frequencies.  Figures  4.12,  4.14,  and  4.15  are  identical.  The 
multiple  receive  aperture  SAR  environment  has  managed  to 
relinquish  the  dependence  of  its  receiver  response  on  the  pulse 
repetition  frequency. 


Figure  4.15:  Radar  Receiver  Response  for  Situation  15 
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5 


CONCLUSION 


This  project  has  documented  in  detail  the  performance 
benefits  of  a  multiple  receiver  SAR  in  contrast  to  a  single 
receiver  one.  In  particular,  it  has  been  shown  that  the  single 
receiver  SAR  environment  constantly  faces  a  tradeoff  involving 
ambiguity,  resolution,  and  swath  width,  A  high  pulse  repetition 
frequency  is  necessary  in  order  to  eliminate  ambiguity  in  the 
along-track  direction.  However,  we  also  require  a  low  pulse 
repetition  frequency  to  eliminate  ambiguity  in  the  across-track 
direction.  Compensation  to  this  ambiguity  may  be  made  in  either 
direction  by  reducing  the  illumination  area.  However,  this  later 
results  in  a  narrow  swath  width.  As  a  result,  it  is  impossible 
{in  the  single  receiver  SAR  environment)  to  complete  eliminate 
ambiguity  in  both  the  along- tack  and  across -track  directions 
whilst  maintaining  wide  swath  width  and  high  resolution. 

The  multiple  receiver  SAR  environment  has  alleviated  this 
problem  because  the  restriction  imposed  by  the  time-bandwidth 
product  is  now  lifted.  We  are  now  able  to  obtain  a  sufficient 
number  of  independent  samples  in  order  to  generate  an  unambiguous 
result.  This  model  has  now  relieved  the  dependence  of  the 
receiver  response  on  the  pulse  repetition  frequency.  An 
unambiguous,  wide  swath,  high  resolution  image  may  be  now 
generated  independent  of  the  choice  of  pulse  repetition  frequency. 

A  SAR  simulator  was  built  to  generate  test  images  that 
reflect  performance  in  a  single  and  multiple  receiver  test 
environment.  It  was  proven  that  when  multiple  receivers  are 
employed  in  a  SAR,  it  is  possible  to  generate  a  high  resolution, 
wide  swath,  and  unambiguous  SAR  image. 
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6  RECOMMENDATIONS  AND  FUTURE  WORK 

Moving  targot  indication  (MTI)  is  anothsr  area  of  SAR  data 
processing  where  there  is  currently  a  considerable  amount  of 
ongoing  research.  In  conventional  SAR  imaging,  it  is  the  ground  - 
and  any  stationary  object  on  it  “  that  is  the  target.  However, 
the  doppler  shift  that  a  moving  target  generates  now  competes  with 
the  SAR's  doppler  shift  generation.  It  is  this  reason  that  makes 
moving  targets  difficult  to  resolve  in  SAR  imaging.  Whilst  there 
have  been  new  methods  such  as  reflectivity  displacement  [10]  and 
radial  velocity  band  filtering  [11]  that  have  been  offered  as 
possible  partial  solutions  to  this  problem,  there  undoubtedly 
remains  much  work  to  be  done  in  the  area  of  SAR  moving  target 
indication. 

In  the  course  of  software  development  for  the  multiple 
receive  element  SAR  simulator,  it  was  ensured  that  functionality 
for  future  testing  of  this  model  on  moving  targets  was  also 
included.  In  essence,  a  velocity  vector  that  defines  each  point 
in  the  imaged  area  has  been  included  as  part  of  the  model.  As 
this  project  focused  on  obtaining  wide  swath,  high  resolution, 
unambiguous  images,  this  vector  was  simply  defined  as  null  for 
present  purposes.  Furthermore,  this  velocity  vector  has  been 
designed  with  three  dimensional  coordinates.  This  implies  that 
the  model  is  suitable  for  research  into  both  ground  moving  target 
indication  (GMTI)  and  airborne  moving  target  indication  (AMTI) . 
With  straightforward  modifications  and  the  simple  introduction  of 
a  ground  or  airborne  velocity,  the  multiple  receive  element  SAR 
simulator  constructed  may  easily  now  be  used  in  a  variety  of 
situations  for  future  research  on  SAR-MTI . 
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DEVELOPMENT  OF  ALGORITHMS  FOR  LINEAR  AND  NONLINEAR 
RETRIEVAL  PROBLEMS  IN  ATMOSPHERIC  REMOTE  SENSING 


Miguel  Velez-Reyes 
Associate  Professor 

Electrical  and  Computer  Engineering  Department 
University  of  Puerto  Rico  Mayaguez  Campus 

Abstract 

Infrared  and  microwave  passive  remote  sensing  of  the  atmosphere  is  used  to  determine  the  earth 
atmospheric  state  and  surface  properties.  Radiance  measured  by  the  radiometers  can  be  used  to  estimate  atmospheric 
parameters  such  as  temperature  and  water  vapor  content.  These  quantities  are  of  primary  importance  for  applications 
in  meteorology,  oceanography,  and  geophysical  sciences.  The  retrieval  or  estimation  of  vertical  profiles  of 
temperature  and  trace-gas  concentrations  is  called  atmospheric  sounding.  Depending  on  the  fi-equency  range  in  the 
electromagnetic  spectrum  being  measured  by  the  radiometer  and  the  atmospheric  quantities  to  be  estimated,  the 
retrieval  or  inverse  problem  of  determining  atmospheric  parameters  from  radiance  measurements  might  be  linear  or 
nonlinear.  In  many  applications,  the  retrieval  problem  requires  the  inversion  of  a  Fredholm  integral  equation  of  the 
first  kind,  making  this  an  ill-posed  problem.  The  numerical  solution  of  the  retrieval  problem  requires  the 
transformation  of  the  continuous  problem  into  a  discrete  problem.  The  ill-posedncss  of  the  continuous  problem 
translates  into  ill-conditioning  or  ill-posedness  of  the  discrete  problem.  Another  difficulty  associated  with  the 
retrieval  problems  is  that  they  are  underconstrained  because  only  a  few  radiance  intensities  are  being  measured  by 
the  sensor  while  the  quantities  of  interest  are  estimated  at  a  higher  vertical  resolution  (more  unknowns).  To  further 
constraint  the  problem  and  deal  with  the  ill-posedness,  prior  information  is  incorporated  into  the  problem  to  convert 
the  ill-posed  problem  into  a  well-posed  one. 

In  this  project,  we  developed  an  algorithm  for  nonlinear  retrieval  problems  in  atmospheric  remote  sensing 
based  on  the  Gauss-Newton  method  to  solve  nonlinear  least  square  problems.  Regularization  was  applied  to  the 
underdetermined  linear  least-squares  problem  of  computing  the  search  direction  using  Tikhonov  and  TSVD 
regularization  methods  for  linear  inverse  problems.  The  method  was  applied  to  the  nonlinear  problem  of 
atmospheric  temperature  retrieval  from  HIRS/2  and  MSU  radiometiy.  A  set  of  MATLAB  macros  implementing  the 
algorithms  was  developed.  Simulation  studies  were  performed  to  understand  the  performance  of  the  proposed 
method.  Three  key  factors  affect  the  performance  of  the  retrieved  profile:  (1)  the  initial  guess,  (2)  the  updating 
criterion  for  the  regularization  parameter,  and  (3)  the  stopping  criterion.  The  implementations  of  the  retrieval 
algorithm  produced  similar  results  for  both  Tikhonov  and  TSVD  regularization.  The  root  mean  square  (RMS)  error 
for  the  estimated  temperature  profiles  varied  from  0.9964  °K  to  4.0728  °K  for  the  noiseless  case. 
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DEVELOPMENT  OF  ALGORITHMS  FOR  LINEAR  AND  NONLINEAR 
RETRIEVAL  PROBLEMS  IN  ATMOSPHERIC  REMOTE  SENSING 


Miguel  Velez-Reyes 


1.  Introduction 

Infrared  and  microwave  passive  remote  sensing  of  the  atmosphere  is  used  to  determine  the  earth 
atmospheric  state  and  surface  properties.  Radiance  measured  by  the  radiometers  can  be  used  to  estimate  atmospheric 
parameters  such  as  tenq)erature  and  water  vapor  content.  These  quantities  are  of  primary  importance  for  applications 
in  meteorology,  oceanography,  and  geophysical  sciences.  The  retrieval  or  estimation  of  vertical  profiles  of 
temperature  and  trace-gas  concentrations  is  called  atmospheric  soxmding. 

Early  satellite  vertical  soxmding  programs  where  justified  on  the  basis  of  their  potential  for  improving 
large-scale  numerical  weather  forecasts  for  periods  greater  than  one  day  over  continents,  as  a  result  of  improved 
observation  over  oceans.  More  recently,  it  has  been  recognized  that  satellite  soxmdings  can  also  play  a  key  role  in 
supplying  the  observations  needed  for  small-scale  intense  weather  predictioa  Vertical  soxmdings  achieved  from 
satellites  have  a  positive  impact  on  numerical  weather  prediction  (NWP)  analyses/forecast  in  the  Southern 
Hemisphere.  It  was  also  demonstrated  that  they  had,  on  average,  a  positive  impact  when  used  in  data-sparse  areas  of 
the  Northern  Hemisphere  (Eyre  et  al.,  1993). 

Depending  on  the  frequency  range  in  the  electromagnetic  spectrxim  being  measured  by  the  radiometer  and 
the  atmospheric  quantities  to  be  estimated,  the  retrieval  or  inverse  problem  of  determining  atmospheric  parameters 
from  radiance  measurements  might  be  linear  or  nonlinear.  In  many  applications,  the  retrieval  problem  reqxiires  the 
inversion  of  a  Fredholm  integral  equation  of  the  first  kind,  making  this  an  ill-posed  problem  (Hansen,  1998).  The 
nximerical  solution  of  the  retrieval  problem  reqxxires  the  transformation  of  the  continuoxis  problem  into  a  discrete 
problem.  The  ill-posedness  of  the  continuous  problem  translates  into  ill-conditioning  or  ill-posedness  of  the  discrete 
problem.  Another  difficxxlty  associated  with  the  retrieval  problems  is  that  they  are  xmderconstrained  because  only  a 
few  radiance  intensities  are  being  measxired  by  the  sensor  while  the  qxiantities  of  interest  are  estimated  at  a  higher 
vertical  resolution  (more  xmknowns).  To  further  constraint  the  problem  and  deal  with  the  ill-posedness,  prior 
information  is  incorporated  into  the  problem  to  convert  the  ill-posed  problem  into  a  well-posed  one  (Hansen,  1998). 

This  report  presents  the  work  supported  by  AFOSR  1996  Sxunmer  Research  Extension  Program 
Subcontract  97-0868.  This  research  project  develops  and  validates  algorithms  for  nonlinear  retrievals.  Simulation 
resxdts  are  presented  for  atmospheric  temperatxire  retrievals  based  on  radiometry  from  the  High  Resolution  Infrared 
Radiation  Soxmder/2  (HIRS/2)  and  the  Microwave  Soxmding  Unit  (MSU)  in  the  TIROS  Operational  Vertical 
Soxmder  (TOVS).  Algorithms  are  based  on  the  Gauss-Newton  (GN)  method  for  nonlinear  least  squares,  and  the 
Tikhonov  and  Truncated  Singular  Value  Decomposition  (TSVD)  regularization  methods  for  linear  inverse  problems 
described  by  Hansen  (1996)  and  Eriksson  (1996).  In  our  work,  we  used  the  Goddard  Laboratory  for  Atmospheres 
(GLA)  TOVS  radiative  transfer  code  provided  by  the  Goddard  Data  Assimilation  Office  (Sienkiewicz,  1996).  A  set 
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of  MATLAB^  fimctions  has  been  developed  for  the  retrieval  algorithms.  The  algorithms  are  validated  by  means  of 
simulations. 


2.  Radiative  Transfer  Theory  for  Forward  Radiance 


Radiative  transfer  theory  describes  the  intensity  of  radiation  propagating  in  a  general  class  of  media  that 
absorbs,  emits,  transmits,  and  scatters  the  radiation  (Lenoble,  1993).  The  general  form  of  the  radiative  transfer 
equation  (RTE)  for  a  plane-parallel  atmosphere  is  given  by 


dR,(0)  _ 


dO 


=  -C7{p)R,(0)^-SM 


(1) 


where  Ry  (0)  is  the  radiance  that  flows  at  each  point  in  the  medium  per  unit  area,  per  unit  of  solid  angle,  at  a  given 

frequency  v  and  zenith  angle  0  ;  ct{p)  is  the  extinction  coefficient  at  pressure  level  p\  and  Sy{9)  is  a  source 

term  at  zenith  angle  0  .  These  last  two  quantities  describe  the  loss/gain  of  radiant  intensity  into  the  given  direction. 

In  the  general  case,  scattering  into  and  from  other  directions  can  lead  to  both  gains  and  losses  to  the 

intensity  and  are  taken  care  by  the  terms  cr(/7)  and  Sy  {9) .  For  the  infrared  and  microwave  regions  of  the 
spectrum,  the  scattering  term  is  usually  neglected  (Janssen,  1993).  If  scattering  is  neglected,  the  only  source  term  to 

consider  is  that  due  to  local  emission  and  the  extinction  coefficient  reduces  to  the  absorption  coefficient  . 

The  emission  follows  KirchofTs  law  at  the  local  temperature,  and  the  source  term  reduces  to 

Sy{0)  =  c7MBAT{p)) 

where  B^{T)  is  the  Planck  flmction: 

2hv^  1 


BST)  = 

c  e 


hv/ 

_  1 


(2) 


and  h  =  6.625  x  10'^'’  Js  is  Planck’s  constant,  c  =  2.988  x  10“  m/s  is  the  speed  of  light,  and  k=  1.381  x  10'^  J/K. 


Equation  (1)  then  becomes 


dRM 

de 


This  is  a  linear  non-homogeneous  first-order  differential  equation  with  solution 


(3) 


where  ps  is  the  surface  pressure,  and/?v(Pi»^)  the  surface  boimdary  condition.  The  term  is  the 

transmittance  from  pressure  p  to  the  top  of  the  atmosphere  at  zenith  angle  0  ,  defined  as 
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,[p, 0)  =  exp  -  cr„(P) sec(6>)c/P 


Rl,{9)  is  the  contribution  from  radiance  reflected  from  the  surface.  Equation  (3)  is  the  basic  radiative  transfer 

equation  for  radiation  emitted  at  the  top  of  the  atmosphere  at  a  single  frequency  v. 

Given  a  radiance  observation  R,,  it  is  convenient  to  think  in  terms  of  the  quantity  Tb,  the  bri^tness 
temperature  of  the  observation.  This  is  the  temperature  a  blackbody  would  have  if  it  gave  off  the  same  radiance  at 
frequency  v.  There  is  a  unique  relationship  between  Ry  and  Tb  given  by 


Tb=B 


liJ  (2Av^  Ic 


where  R^  is  given  by  (3)  (Susskind  et  al.,  1984).  In  the  microwave  region  of  the  spectrum, 

hv  «  kT 


which  results  in  (2)  taking  the  form 


Iv^kT  2kT 


where  X  is  wavelength.  This  is  known  as  the  Ralei^-Jeans  approximation  (Lenoble,  1993).  From  this  expression  it 
is  clear  that  in  the  microwave  region,  the  energy  emitted  is  proportional  to  the  physical  temperature  T.  In  terms  of 
bri^tness  temperature  Tb.,  (3)  takes  the  form 

3L = up..0)tXp„0) + ^np)]^^^dp+T:,(0) 

In  the  context  of  our  experiment  with  HIRS/2  and  MSU  instruments,  each  channel  represents  an 
observation  taken  over  a  range  of  frequencies, 

Ri{e)  =  lj>,{v)R,{e)dv 


where  (v)  is  the  filter  response  fimction  for  satellite  channel  i  (Sienkiewicz,  1996).  Equation  (3)  can  be  rewntten 
as 

R,  OT = R,  (p,.e)r,  (ft , «) + Bi  [np)]^^^^dp + <(«) 

where  now  5,(7’)  =  5„  (7’) ,  f,(^)  =  are  evaluated  at  a  prescribed  central  channel  frequency  v,  and  the 

transmittance  t,  is  a  modeled  transmittance  fitted  to  channel-averaged  transmittances  obtained  from  line-l^-line 
calculations.  Although  the  Planck  function  5„  (t)  changes  with  frequency  v  it  is  assumed  that  the  width  of  the 
channel  response  function  ^  is  small  enough  that  the  Planck  function  at  the  central  fluency  v,  can  be  used  without 
modification  (Sienkiewicz,  1996). 


2.1  Radiative  Transfer  Theory  for  Reflected  Radiance 

Let  us  take  a  look  now  at  Rlid) ,  the  contribution  from  the  reflected  radiance.  Susskind,  et  al.  (1984) 
define  it  as 

R,'{0)  =  {\-e^)R:iT,{p„e)+p,H,T,s{m)  (5) 

where  Rj  >1  is  the  effective  downward  flux  of  radiation  from  atmospheric  emission  for  channel  /,  p,  is  the  bi¬ 
directional  reflectance  of  the  surface  to  the  satellite  of  solar  radiation  from  the  sun  with  zenith  angle  Gs ,  Hi  is  the 
solar  radiation  at  the  top  of  the  atmosphere,  and  (0, 0^ )  is  the  atmospheric  transmittance  along  the  entire  path  of 

incident  and  reflected  solar  radiation.  It  is  important  to  notice  that  the  first  term  in  the  right-hand  side  of  (5) 
represents  the  contribution  of  downwelling  atmospheric  radiation,  and  the  second  term  represents  the  contribution  of 
reflected  solar  radiation.  Hence,  these  quantities  account  for  the  upper  boundary  conditions. 

For  the  first  term  Rj  'I ,  there  are  several  ways  to  calculate  it,  depending  on  the  channel  it  is  computed  for. 
According  to  (Susskind  et  al.,  1984),  for  HIRS  IR  chaimels  8,  10,  18,  and  19  where  the  atmosphere  is  optically  thin, 
the  effective  downward  flux  Rj  -i  is  calculated  as: 

R,l  =  2oos(e)\^‘^"'^B,{T)jT  (6) 

For  the  other  HIRS  chaimels,  the  effective  downward  flux  Rj  >1  is  modeled  according  to  (Komfield  and 
Susskind,  1977): 

R.i' =  -  r,{p„0))  (7, 

where  is  the  surface  air  temperature  and  F,  is  a  chaimel-dependent  constant. 

Finally,  for  the  MSU  microwave  chaimels,  the  downward  flux  Rj  -I  is  calculated  as: 


where  Tbb  is  the  background  radiation  equal  to  2.5°  K. 


4? 


h{p,0) 


\dp 


(8) 


2.2  Representation  in  Finite-Difference  Form 

To  numerically  solve  the  equation  of  radiative  transfer,  we  used  the  Goddard  Laboratory  for  Atmospheres 
(GLA)  TOYS  rapid  algorithm  for  forward  radiance  and  Jacobian  calculation  (Sienkiewicz,  1996).  The  algorithm 
consists  of  a  set  of  Fortran  modules  for  the  computation  of  the  forward  radiance  as  stated  in  (3),  and  the 
corresponding  Jacobian  (derivative  with  respect  to  input  parameters). 
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The  a^^oach  followed  in  the  modules  was  to  develop  a  finite-<iifierence  forward  model  representation,  and 
then  perform  derivative  operations  on  this  model  to  obtain  the  Jacobians  for  temperature.  The  set  of  brightness 
temperatures  7^(v)  is  also  computed  by  means  of  (4). 

The  forward  radiance  calculation  from  (3)  can  be  written  in  finite-difference  form  as 

N 

Mff)  =  -  r,/0))  +  s;(e) 

/=! 

where  the  subscript  /  indicates  quantities  corresponding  to  channel  /,  Tj  is  the  mean  layer  temperature  at  the 
atmospheric  level  or  height  /,  and  is  the  total  number  of  layers.  Note  that  for  microwave  channels  we  use  7/ 

instead  of  5/(7/),  In  our  problem,  /  =  1,  2,...,  22  and  A^=  29  levels. 

Equation  (5)  for  the  reflected  radiance  is  e>q>ressed  in  finite-difference  form  as 

RjiO)  =  {}  - ^ 

where  the  effective  solar  zenith  angle  is  ^eff  ~  ^(sec(^)  +  se  c(»))  and  cp  is  the  solar  zenith  angle.  For 

channels  where  the  atmosphere  is  optically  thin  (HIRS  IR  channels  8,  10, 18,  and  19),  (6)  is  approximated  as 

N 

Rii  =  2 cosO^ 5, (7i)(T,  -  r,  ,(^)) 

/=! 

Similarly,  for  the  other  IR  chaimels  (7)  is  approximated  as 

where  T^  is  the  surface  air  tenqjerature.  For  the  microwave  channels,  the  finite-difference  representation  of  (8)  is 


given  as 


-  1 


R,i  =  +  — 

i=\ 


The  total  transmittance  between  satellite  level  and  a  level  /,  is  the  product  of  effective  layer  transmittances 
for  all  layers  between  satellite  level  and  the  level  /.  These  effective  layer  transmittances  are  modeled  as  the  product 
of  terms  representing  absorption  due  to  atmospheric  gases  having  a  fixed  mixing  ratio,  absorption  due  to  ozone,  and 
absorption  due  to  water  vapor.  Coefficients  for  these  terms  are  obtained  by  fitting  transmittances  obtained  from  line- 
by-line  calculations.  Several  sets  containing  the  transmittance  line-by-line  calculations  are  precomputed  and 
included  with  the  GLA  TOYS.  A  detailed  description  of  the  algorithm  used  for  transmittance  calculation  can  be 
found  in  (Sienldewicz,  1996). 

The  calculation  of  the  temperature  Jacobian  J(T),  or  derivative  of  brightness  temperature  with  respect  to 
temperature  is  accomplished  through  operation  of  the  finite-difference  representation  of  the  forward  problem.  The 


20-7 


expressions  needed  for  the  Jacobian  calculation  can  be  derived  by  repeated  applications  of  the  nhain  rule  for 
derivatives.  Recall  from  (4)  that  %  =  B  ■  Thus,  the  derivative  with  respect  to  temperature  T  is  given  as 


_  _  ^b,i  ^  _ _ 

'  ^  ~ 


^b,i 


N 

M  '  '  tyii  ^  ' 

where  ^  ~  ^-^^-1  is  the  mean  layer  temperature. 

As  we  can  see  from  (9),  the  Jacobian  plays  a  crucial  role  in  the  implementation  of  the  temperature  retrieval 
algorithms,  since  it  provides  the  sensitivity  weighting  functions,  which  relate  the  change  in  brightness  temperature  to 
the  change  in  atmospheric  temperature  profile.  As  we  will  see  in  future  Sections,  the  sensitivity  functions  will  be 
needed  to  solve  our  inversion  problem.  The  weighting  functions  for  the  HIRS/2  and  MSU  channels  used  in  our 
retrieval  problems  are  shown  in  Figure  1,  for  the  1976  US  Standard  atmosphere  model. 


3.  Hl-Posedness  of  Temperature  Retrievals 

If  the  atmosphere  strongly  absorbs,  most  of  the  contribution  to  the  measured  brightness  temperature  will 
come  from  the  atmosphere  itself  Assuming  that  the  surface  temperatme  contribution  ^,0)T^{^p ^,9)  and  the 

term  due  to  the  reflected  radiance  (^)  are  known,  (3. 1 1)  can  be  written  as 

or  as 

T^  =  ^^K(v,p)^T{p))lp  (10) 

where  - (t^v iPs > (Ps > (^))  and  K{v, p)  = 

4? 

Equation  (10)  is  a  Fredholm  integral  equation  of  the  first  kind 

rb 

g{x)  = 

The  inversion  of  the  Fredholm  integral  in  (4.3)  is  an  ill-posed  problem  (Groetsch,  1984  and  Tikhonov, 
1977).  The  following  definition  is  taken  from  Hadamard  (Groetsch,  1984  and  Tikhonov,  1977). 
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HIRS/2  Ch  1-2  and  MSU  Ch  4 


HIRS/2  Channels  3-7 


HIRS/2  Ch  10, 13, 17  HIRS/2  Ch  14, 15  and  MSU  Ch  1 


Figure  1:  Weighting  functions  for  HIRS/2  and  MSU  channels. 


Definition  1:  Well-Posed  Problem:  A  problem  is  given  by  a  datum  g  and  a  solution  u,  and  it  is  Well-Posed  (in  the 
sense  of  Hadamard)  when: 

1 .  For  each  datum  g  in  a  class  of  functions  Y  there  exists  a  solution  u  in  a  prescribed  class  X  (existence). 

2.  The  solution  u  is  unique  in  X  (uniqueness). 

3 .  The  dependence  of  u  upon  g  is  continuous. 

An  Ill-Posed  problem  is  a  problem  that  is  not  well-posed  (i.e.  it  fails  at  least  one  condition  stated  above). 

This  is  not  a  precise  mathematical  definition.  To  make  it  precise  for  a  particular  problem,  the  notion  of  a 
solution  and  the  admissible  data  have  to  be  specified.  Also,  the  topology  used  for  measuring  continuity  needs  to  be 
established.  These  specifications  have  to  be  such  that  they  are  appropriate  for  the  particular  problem  (Engle,  1996). 
The  finite-difference  representation  of  (10)  is  stated  as 

L  01) 

/=1 

where  7*,  =  7],,  -  (t],,  ^  (^)  +  TJi  (^))  and  K{i ,  /)  =  (0)  -  (0))  .  This  leads  to  the  nonlinear  model 

T,=f(T)+e 

where  T  is  the  temperature  profile  to  be  retrieved,  f:  91”  is  a  nonlinear  vector  fimction  that  maps  the  desired 

parameters  to  the  measurement  space,  and  e  is  an  error  term  associated  with  measurement  noise  and  the  truncation 
error  arising  from  the  discretization  of  the  integral  in  (10).  This  problem  also  suffers  from  inherited  ill- 
conditioning  and  is  underdetermined  since  in  most  retrieval  problems  of  interest  m<n. 

4.  Regularization  of  the  Discrete  Problem 

The  retrieval  problem  introduced  in  (1 1),  has  two  major  difficulties  associated  with  it: 

•  ill-conditioning  due  to  the  Ul-posedness  of  inverting  the  associated  integral  equation  (10),  and 

•  multiple  solutions  because  of  trying  to  estimate  temperature  profiles  with  more  atmospheric  levels,  29, 
than  measurements  available,  22. 

To  overcome  these  difficulties,  we  can  use  the  so-called  regularization  methods.  Regularization  theory  (Tikhonov, 
1977)  transforms  an  01-posed  problem  into  a  well-posed  one,  using  a  priori  knowledge  on  the  nature  of  the  solution. 
It  is  necessary  to  incorporate  fiuther  information  about  the  desired  solution  in  order  to  stabilize  the  problem  and  to 
single  out  a  useful  and  stable  solution  (Hansen,  1998).  This  is  the  purpose  of  regularization. 

There  are  various  methods  to  achieve  regularization  of  ill-posed  problems.  In  our  algorithm,  wc  first 
linearize  the  nonlinear  retrieval  problem  and  then  regularize  the  resulting  linear  problem.  Next,  we  will  present  the 
two  methods  we  used  to  regularize  the  linearized  problem.  These  are  the  Tikhonov  method  and  the  Truncated 
Singular  Value  Decomposition  (TSVD)  method. 
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4.1  Tikhonov  regularization 

The  most  common  and  well-known  fonn  of  regularization  is  the  one  known  as  Tikhonov  regularization 
(Tikhonov,  1977).  To  explain  the  method,  consider  the  underdetermined  linear  least-squares  problem 

min  [Ax-blj,  m<n.  (12) 

One  way  to  regularize  (12)  is  by  computing  the  estimate,  X ,  as  the  solution  to  the  optimization  problem 

=argmin{||^x-bll2  +4x?/||4*-Xol2|.  (13) 

or  equivalently 

(K  Wb 

where  ||  •  II2  is  the  i  ^  xo  is  an  initial  estimate  of  the  solution,  and  "kriKH  is  the  regularization  parameter 

for  the  Tikhonov  method.  In  many  instances,  the  matrix  L  is  typically  either  the  identity  matrix  /„,  as  in  our  case,  or 
an  approximation  to  some  derivative  operator  (Hansen,  1992). 

A  key  issue  in  this  method  is  the  selection  of  the  regularization  parameter  Xtikh-  This  parameter  controls 

the  weight  given  to  minimization  of  the  side  constraint,  X  -  Xq  ^ ,  relative  to  minimization  of  the  residual  norm, 

||zlx  —  bllj .  A  large  Xtikh  (i.e.,  a  large  amount  of  regularization)  favors  a  solution  close  to  the  initial  guess  at  the 

cost  of  a  large  residual  norm.  Conversely,  a  small  'Ktikh  (i  e.,  a  small  amount  of  regularization)  favors  a  small 
residual  norm  at  the  cost  of  a  high  side  constraint.  An  optimal  value  of  "kriKH  balances  the  two  terms.  This  is  further 
discussed  in  (Hansen,  1998). 

In  our  algorithm,  flie  selection  of  the  Tikhonov  regularization  parameter  A.  was  based  on  the  L-curve 
method.  The  L-curve  is  a  plot  in  log-log  scale,  for  all  valid  regularization  parameters,  fx  -  Xq  ^ ,  which  indicates 

the  discrepancy  from  the  initial  guess,  versus  the  corresponding  prediction  error  I^^X  b||2 .  The  curve  displays 

the  compromise  between  minimization  of  these  two  quantities  (Hansen,  1998).  Figure  2,  adapted  from  (Hansen, 
1992),  shows  the  generic  form  of  the  L-curve.  The  horizontal  part  of  the  curve  corresponds  to  solutions  where  the 
regularization  error  dominates,  i.e.,  where  so  much  regularization  is  applied,  and  the  solution  stays  very 

smooth.  For  this  case  X  -  Xq  ^  changes  very  little  with  I-tikh-  The  vertical  part  of  the  curve  corresponds  to 
'  The  norm  is  given  ll^l^  =  4x^x . 


solutions  that  are  dominated  by  the  prediction  error,  and  the  residual  norm  does  not  change  much.  In  general, 
the  optimal  regularization  parameter  is  very  close  to  the  L-curve’s  comer.  Thus,  locating  the  comer  of  the  curve, 
an  approximation  to  the  optimal  regularization  parameter  can  be  computed  (Hansen,  1996).  The  MATLAB 
Regularization  Toolbox  (Hansen,  1992)  has  a  routine  to  compute  this  parameter. 


Figure  2:  Generic  form  of  the  L-curve. 


4.1.1  Truncated  Singular  Value  Decomposition 

Before  we  describe  the  Truncated  Singular  Value  Decomposition  (TSVD)  regularization  method,  it  is 
convenient  to  introduce  the  definition  of  singular  value  decomposition  (SVD).  The  SVD  is  one  of  the  best  numerical 
tools  for  analysis  of  discrete  ill-posed  least-squares  problems  like  (12),  since  it  reveals  all  the  difficulties  associated 
vtith  the  ill-conditioning  and  rank  deficiency  of  the  matrix  A.  The  use  of  SVD  is  the  most  reliable  way  to  determine 
the  rank  and  nearness  to  rank-deficiency  of  A  (Datta,  1995).  The  following  definition  of  the  SVD  has  been  modified 
to  fit  our  problem. 

Definition  4.2  Singular  Value  Decomposition:  Let  A  €  9?'”’'”  be  a  rectangular  matrix  with  m  <  n  .  Then  the 
SVD  of  A  is  a  decomposition  of  the  form 


A  =  VZU^  =  X'',^,»!. 

;=1 


(14) 


where  U  =  (u], and  V  = 

and  where  2  =  di  ag(o’, ,  •  •  • ,  cr„, ) 
that 


are  matrices  with  orthonormal  columns,  U^U  =  V^V  —  I„, 
has  non-negative  diagonal  elements  appearing  in  non-increasing  order  such 


a.  >-">cr„  >0 

1  fn 
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The  numbers  <Ji  are  the  singular  values  of  A  while  the  vectors  Ui  and  Vi  are  the  left  and  right  singular  vectors  of  A, 
respectively. 

In  tenns  of  the  SVD  of^,  the  minimum  norm  least-squares  solution  of  (12)  is  given  as 

^vfb 

X  =  2.  — (15) 

/=!  CT. 

The  approach  of  TSVD  is  to  derive  a  new  problem  with  a  well-conditioned  rank  deficient  coefficient  matrix.  Using 
the  SVD  of  A,  we  obtain  the  closest  rank-A:  apiM'oximation^it  to  A  (measured  in  the  2-norm)  by  truncating  the  SVD 
expansion  in  (14)  at  k.  Thus,  Ak  is  given  by 


where  k  is  the  TSVD  regularization  parameter  Xtsvd-  When  A  is  replaced  by  then  we  obtain  a  least 

square  problem  niin  ~  ^  2  ’  truncated  SVD  solution  (i.e.,  the  minimum-norm  solution  to 


this  problem)  is  given  by 


T  (  T  b 


1=1 


and  the  complete  method  is  referred  to  as  the  truncated  SVD  (TSVD)  (Hansen,  1996).  For  all  cases  where  k<m, 
(16)  will  be  a  truncated  version  of  the  minimum  norm  least-squares  solution  in  (15). 


5.  The  Discrete  Picard  Condition  and  Filter  Factors 

In  this  section,  we  introduce  the  discrete  Picard  condition,  and  the  concept  of  filter  factors.  The  latter  play 
an  important  role  in  connection  with  regularization  of  discrete  ill-posed  problems  (Hansen,  1996). 

Let  us  refer  to  the  Fredholm  integral  in  (10)  and  its  finite-difference  form  of  (11),  both  repeated  here  for 
convenience: 

t  =£K(v,p)B[T(p)]dp 

and 

%  =  2;K(ij)B[Tj] 

j=l 

We  can  express  the  kernel  K ,  of  size  22  x  29  and  rank  22,  by  its  SVD  expansion  as 

K  =  io,v^„^ 

i=l 
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It  is  reasonable  to  think  that  the  behavior  of  the  singular  values  a,  and  the  singular  functions  u,  and  v,  is  strongly 
connected  with  the  properties  of  K,  It  is  useful  to  note  the  following,  (Hansen,  1996): 

•  The  smoother  the  kernel,  the  faster  the  singular  values  decay  to  zero  (where  the  niunber  of  continuous  partial 
derivatives  measures  smoothness  of  the  kernel). 

•  The  smaller  the  singular  values,  the  more  oscillations  (or  zero-crossings)  in  the  singular  functions. 

The  decay  rate  of  the  singular  values  is  so  fundamental  for  the  behavior  of  ill-posed  problems,  that  some  have 
proposed  it  as  a  tool  to  characterize  the  degree  of  ill-posedness.  The  mathematical  formulation  used  to  guarantee 
that  there  exist  a  square  integrable  solution  Up)  with  finite  L2  norm  in  (16),  based  on  an  existing  SVD  of  the  kernel 
K  is  given  by  the  Picard  condition,  (Hansen,  1996). 

For  discrete  ill-posed  problems,  there  is  also  a  discrete  Picard  condition.  Consider  the  discrete  ill-posed 
problem  (12).  Let  b  =  b  +  e,  where  e  is  an  error  term  due  to  measurement  errors,  approximation  errors,  and 

rounding  errors,  and  b  is  the  unperturbed  right-hand  side.  The  discrete  Picard  condition  provides  the  mathematical 
formulation  needed  to  assure  that  we  compute  a  regularized  solution  xx  fi'om  the  given  right-hand  side  b  such  that  \\ 
approximates  the  exact  solution  x  (Hansen,  1992). 

Definition  4.3  Discrete  Picard  Condition:  The  unperturbed  right-hand  side  b  in  a  discrete  ill-posed  problem  with 

regularization  matrix  L  satisfies  the  Discrete  Picard  Condition  if  the  Fourier  coefficients  |^rb|,  on  the  average, 

decay  to  zero  faster  than  the  singular  values  at.  It  can  be  shown  that  if  (16)  satisfies  the  Picard  condition  for  the 
continuous  case,  then  the  discrete  ill-posed  problem  obtained  by  discretization  of  the  integral  equation  satisfies  the 
discrete  Picard  condition  (Hansen,  1992). 


Now  consider  again  the  standard  solution  given  in  (15).  If  the  Fourier  coefficients 


in  the  numerator 


do  not  decay  as  fast  as  the  singular  values,  the  solution  x  will  be  dominated  by  terms  in  the  sum  corresponding  to  the 
smallest  singular  values  a,  in  the  denominator.  Thus,  the  solution  x  will  have  many  sign  changes  and  will  appear 
completely  random.  Regularization  methods  attempt  to  filter  out  this  effects,  or  at  least  reduce  them  to  some  degree. 
Thus,  in  terms  of  SVD  our  desired  regularized  solution  can  stated  as 

xx  =  2-fi— 


i=l 


G: 


where  f  are  called  the  filter  factors  (Hansen,  1998).  They  have  the  property  that  as  a,  decreases,  the  corresponding  f 
tend  to  zero  such  that  the  contribution  of  the  smallest  a,  to  the  solution  is  filtered  out. 

There  are  various  ways  of  computing  the  filter  factors.  In  fact,  different  regularization  methods  can  be 
defined  as  different  ways  to  determine  filter  factors  (Hansen,  1996).  For  Tikhonov  regularization  the  filter  factors 
are  given  by 
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r.2 

^TIKH 

for  i  ^  ^tsvd 
otherwise 


In  this  section,  we  explain  the  retrieval  algorithms  developed  in  this  research  based  on  the  Gauss- 
Newton  algorithm.  We  derive  the  Gauss-Newton  formulation  for  our  problem  of  interest,  and  explain  how 
regularization  is  incorporated  into  the  algorithm.  A  description  of  the  developed  set  of  Matlab®  functions  is  included 
along  with  diagrams  that  show  how  these  functions  interact.  Our  approach  is  mostly  based  in  a  regularization 
method  developed  by  (Eriksson,  1996).  However  some  modifications  were  incorporated  to  make  it  work  for  our 
application.  Finally,  we  provide  a  brief  introduction  to  the  GLA  TOYS  code  for  forward  radiance  and  Jacobian 
computation  (Sienkiewicz,  1996). 

6.1  Gauss-Newton  Algorithm 

The  Gauss-Newton  method  for  nonlinear  least  square  problems  can  be  summarized  as  follows 
Algorithm  1:  GN  algorithm  for  nonlinear  least  squares 

Given  an  initial  guess  ,  and  set  a  =  0.  Iterate  until  convergence  is  achieved: 

1.  Update  the  estimate  according  to 

where  the  search  direction  is  computed  by  solving  the  least-squares  problem 

^  =  arc  mm 
s 

y(a)  is  selected  so  that  the  cost  function  is  sufficiently  decreased,  is  the  estimate  at  the  iteration 

a,  and  J(T)  is  the  Jacobian  ofi  evaluated  at  T. 

2  If  the  convergence  criterion  is  satisfied,  then  stop,  else,  set  a=a+l  and  go  back  to  1. 

Notice  that  the  computation  of  the  search  direction  in  (17)  is  based  on  the  linearization  of  the  original  nonlinear 
problem.  The  linear  least  square  problem  (17)  is  an  underdetermined  problems  and  is  here  where  regularization 
techniques  for  linear  discrete  problems  are  applied.  This  approach  is  based  on  (Eriksson,  1996). 


and  for  TS  VD  by 


6.  Gauss-Newton  Retrieval  Algorithm 


6.2  Retrieval  Using  Gauss-Newton  Algorithm  with  Tikhonov  Regularization 

As  discussed  in  Chapter  4,  the  linear  least-squares  problem  in  (17)  is  an  imderdetermined  problem. 
Therefore,  the  regularization  algorithms  derived  are  used  to  regularize  the  linear  problem  using  different  techniques. 
A  regularized  version  of  Algorithm  5. 1,  including  Tikhonov  regularization  can  be  summarized  as  follows 
Algorithm  2:  Regularization  algorithm  based  on  GN  and  Tikhonov  regularization 

Given  an  initial  temperature  profile  \  a  center  c,  an  initial  regularization  parameter  and  with  a  ^  0, 
iterate  until  convergence  is  achieved: 

1.  Compute  rif  ^  ^  j  ~  ~  j,  and  the  SVD  of  J  . 

2.  Do  a  convergence  test.  If  the  following  criteria  is  fulfilled  then  stop  the  iterations: 


for  h 


w_ 


jWa  +  rW 


h(-) 

-hW 

1  + 

h(«) 

<  e 


3.  Compute  the  Tikhonov  regularization  parameter  using  the  L-curve  method  (Hansen,  1998) 

4.  If 

,T  ,  ,  /  ,  ^  \2, 


-  ^tikhY 


(18) 


and  ^  ^IKH »  1^^^  ^IKH  ^  ^IKH  ^TIKH  ""  ^IKH  ■  7 

5.  Compute  the  search  direction  6^°^  solving  the  Tikhonov  regularized  minimization  problem 

M 


5^“^  =  argmin<^ 

TTVH  I 


V  i(«)  ^ 


8- 


using  the  SVD  decomposition  of  where  ^  --  j ,  md 

^  is  the  estimate  ofT  at  iteration  a. 

[a\ 

6.  Determine  the  step-length  (X  so  that  the  objective  function  is  sufficiently  reduced,  using  the  Armijo 
rule  (Bertsekas,  1995). 

7.  Update  the  estimate  according  to 
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f  (“*')  =  f  W  +  al'-lgW 

^TIKH 

8.  Set  a  =  a+7  and  go  back  to  step  1. 

Initialization  plays  an  important  role  in  optimization  algorithms.  In  general,  the  final  estimate  is  hi^y 

dependent  of  the  initial  profile.  The  selection  of  the  initial  guess  was  primarily  based  on  the  location  of  the 
tropopause  height.  This  would  allow  us  to  stutfy  whether  the  algorithm  is  capable  of  determining  tropopause  height 
when  fed  with  initial  guesses  that  have  the  tropopause  height  &r  fi-om  the  actual  height.  The  capability  of 
determining  this  peak  is  a  very  useful  feature  of  the  retrieval  algorithm  since  most  important  weather  features  are 
located  at  this  region  of  the  atmosphere  (Trenberth,  1992). 

Another  parameter  that  must  be  chosen  carefully  is  the  center  c.  We  want  to  make  c  as  close  as  possible  to 

the  initial  temperature  profile  In  our  problem,  we  set  c  =  -  0.25 .  The  brightness  temperatures  Tb,  and 

the  Jacobian  )  were  computed  using  the  GLA  TOYS  rapid  algorithm. 


The  criteria  in  step  4  of  the  algorithm  is  essential  to  achieve  convergence  (Eriksson,  1996).  To  assure 
convergence,  the  gradient  of 


change  to  a  smaller  ^jkh  value.  We  also  require  a  decreasing  sequence  of  ^jkh  values.  Proof  of  the  validity  of 

this  criterion  in  (18)  is  provided  in  page  89  of  (Eriksson,  1996). 

Finally,  the  step-length  computation  in  step  6  of  the  algorithm  is  done  using  Armijo  rule  (Bertsekas,  1995). 

This  rule  is  essentially  a  successive  reduction  rule.  An  initial  step-length  OC  is  chosen,  and  if  the  corresponding 


vector  +  a^^^5^^^does  not  yield  a  reduction  of  the  residual  norm  function 

g|T^°^j=  ,  i.e.,  ^  ^  the  step-length  is  reduced  by  a  certain 

factor  until  the  value  of  the  residual  norm  is  improved.  The  Armijo  rule  reduces  the  possibility  of  convergence  to  a 

(a) 

local  minimum  by  adding  the  following  constraint  to  the  reduction  of  CX  (Bertsekas,  1995), 

g|f  .  g|f  -h  j  >  -  cry9”’5Vg|T(“^  j  .  (5.22) 
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The  step-lengths  ^  m=  0,  1,  are  tried  successively  until  the  above  inequality  is  satisfied  for  m  =  /Wa, 


Usually,  (T  ejlO  ^10  ' 


,  and  5  =  1. 


A  very  important  part  of  this  iterative  process  is  the  computation  of  new  brightness  temperatures  and  a  new 
Jacobian  for  each  iteration.  The  new  estimated  temperature  profile  is  used  as  new  input  to  the  radiative  transfer 
code,  to  compute  the  corresponding  outputs. 


6.2.1  Retrieval  Using  Gauss-Newton  Algorithm  with  TSVD  Regularization 

This  algorithm  difiers  fi*om  Algorithm  5.2  in  the  way  that  steps  3  through  5  are  computed.  The  complete 
algorithm  is  outlined  next 

Algorithm  3:  Regularization  algorithm  based  on  GN  and  TSVD  regularization 

Given  an  initial  temperature  profile  ^ ,  a  center  c,  and  with  a  =  0,  iterate  until  convergence  is  achieved: 


Compute  =  Tb  -  j,  and  the  SVD  of  3  . 


2.  Do  a  convergence  test.  If  the  following  criteria  is  fulfilled  then  stop  the  iterations:  for 


h<"i= 


_h(°) 


1  + 


<  £ 


3(a) 

3.  Compute  the  regularization  parameter  ^SVD  vsing  the  following  criterion: 

„  ,  ,  ;i{0)  _  o  ;|(1)  _  7 

a.  Let  ^svD  ~  ^SVD  ~  '  ■ 

.  _  3(^1  _  3(^) 

b.  For  a>  1:  If  ^SVD  ~  ^SVD,MAX  H^en  ^SVD  ~  ^Sm  • 


C.  Let  ’  = 


(“) 

w'  '  = 


“)] 


„(“)  _  J“-l) 


K")  _fH) 


.  If  ’  -5'’  ''  >y  .  and  ,  and  1^°^  >  CO  , 


t  1 

then  ^^SVD  —  ^^SVD  ^ 

t  0(^+1)  1  o 

d.  If  ^  CO  y  then  ^q'SVD  ““  ^SVD  ^ 

e.  If  or  <  y  ,  then  “  ^SVD  ^ 
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r*  2^+^)  -  ;h)  + 1 

I-  ^SVD  ~  ^  ^  • 

Compute  the  search  direction  as  follows 


and 


[a] 

5,  Determine  the  step-length  Or  '  so  that  the  objective  function  is  sufficiently  reduced,  using  the  Armijo 
rule  (Bertsekas,  1995). 

6.  Update  the  estimate  according  to 


7.  Set  a  =a+l  and  go  back  to  step  1. 


-(a+l)  _  ,  ^  << 

From  the  definition  of  TS  VD  we  know  that  ^SVD  is  ^  positive  integer  and  that  1  ^svD  —  ^  >  where 

jfa+l) 

m  is  equal  to  the  munber  of  singular  values  of  the  SVD  of  J.  Thus,  if  any  computed  ^SVD  i®  greater  than  the 
maximum  permissible  value  m,  it  will  be  assigned  the  maximum  value.  Also,  we  used  7=1000,  and  ®=4.  Much  care 


must  be  taken  when  computing  the  regularized  solution  for  the  TSVD  subproblem  (Eriksson,  1996).  The  derivation 
of  a  regularization  parameter  update  criterion  that  assiues  global  convergence  of  the  solution  is  not  a  simple  task. 
The  criterion  outlined  in  step  3  of  Algorithm  5.3  was  defined  by  studying  carefiilly  the  nature  of  om  problem.  The 


main  characteristic  of  this  criterion  is  that  the  regularization  parameter  A-j-gyj)  always  increases  (reducing  the 
apphed  regularization)  We  know  from  the  theory  that  enough  regularization  must  be  aj^lied  at  the  beginning  of 
iterations,  and  then  it  should  be  gradually  decreased.  But  it  is  very  difficult  to  know  how  fast  or  how  slow  this 
should  be  done,  or  how  much  regularization  must  be  applied  at  the  beginning.  Allowing  the  regularization 
parameter  to  decrease  too  fast  will  result  in  convergence  towards  a  local  minimum  with  a  large  residual  norm 
(Eriksson,  1996). 


6.3  GLA  TOYS  Forward  Radiance  and  Jacobian  Computations 

In  order  to  implement  the  retrieval  algorithm,  the  forward  radiative  transfer  calculation  was  needed.  This 

would  provide  us  with  the  brightness  temperatures  =  f(f  as  a  fimction  of  the  estimated  profile,  as  well  as 
the  Jacobian  .  This  computation  is  performed  at  least  once  on  each  iteration. 
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We  solved  the  RTE  in  (3)  using  the  Goddaid  Laboratory  for  Atmospheres  (GLA)  TOYS  Rapid  Algorithm 
for  Forward  Radiance  and  Jacobian  Computations  Version  1.0.  This  is  a  Fortran  code  developed  at  the  Data 
Assimilation  Office  of  the  Goddard  Laboratory  for  Atmospheres,  and  the  General  Sciences  Corporation  in  Laurel, 
Maryland  (Sienkiewicz,  1996).  The  code  allows  the  computation  of  forward  radiance,  brightness  temperatiue,  and 
brightness  temperature  Jacobian  for  simulations  of  the  TOYS  HIRS/2  and  MSU  radiometers.  A  detailed  description 
can  be  found  in  (Sienkiewicz,  1996). 

6.4  Description  of  the  MATLAB®  Code 

The  developed  algorithms  were  implemented  as  MATLAB®  functions.  We  divided  the  code  into  eight  basic 
functions  that  allowed  the  debugging  and  documentation  to  be  easier  and  clearer.  The  main  functions  used  in  the 
implementation  of  the  algorithm  are  described  in  Table  1.  Table  2  describes  auxiliary  routines  to  the  main  functions. 
The  diagram  in  Figure  3  shows  the  function  interaction,  along  with  the  most  important  input  and  output  parameters 
for  the  main  functions.  The  square  box  contains  the  program's  output  parameters. 


Table  1:  Main  functions  for  the  GN-bascd  regularization  algorithm. 


Function 

Description 

gn_regu 

Main  program.  Other  functions  are  called  from  this  routine.  When  execution  is 
completed  this  function  returns  output  parameters  of  interest. 

Jfinit 

Computes  initial  values  for  brightness  temperature  and  Jacobian  matrix.  Also 
retiuns  initial  guess  for  temperatiue  profile. 

windraw 

Opens  a  window  and  generates  plots  of  important  quantities.  A  call  is  made  to 
function  picard  *. 

tikhol 

Computes  the  regularization  parameter  for  the  Tikhonov  subproblem.  A 
selection  criteria  is  apphed  to  assme  global  convergence.  A  call  is  made  to 
function  1  curve  *. 

tikho 

Computes  the  Tikhonov  regularized  solution  of  the  search  direction. 

tsvdl 

Computes  the  regularization  parameter  for  the  TSVD  subproblem.  A  selection 
criteria  is  applied  to  assure  global  convergence.  A  call  is  made  to  function 

1  curve  *. 

tsvdreg 

Returns  the  TSVD  regularized  solution  of  the  search  direction.  A  call  is  made  to 
function  tsvd  *. 

update 

Computes  the  updated  temperatiue  profile  for  each  iteration.  Implements  the 
Armijo  criterion  for  the  computation  of  the  step-length. 

*  Functions  from  P.C.  Hansen’s  Regularization  Tools 


Table  2:  Auxiliary  functions  for  the  GN-based  regularization  algorithm. 


Function 

Description 

inpar 

Generates  needed  atmospheric  profiles  used  to  run  the  GLA  TOYS  simulation. 

atm_prof 

Data  file  loaded  by  function  inpar  to  read  part  of  the  needed  atmospheric 
profiles. 

profiles 

Generates  a  vector  with  the  chosen  initial  guess  of  the  temperature  profile. 

picard  * 

Facilitates  the  visual  inspection  of  the  Picard  condition. 

1  curve  * 

Plots  the  L-curve  and  finds  its  comer. 

1  comer  * 

Auxihary  function  to  function  1  curve. 

Icfim  * 

Auxihary  function  to  1  comer. 

tsvd  * 

Computes  the  TSVD  regularized  solution  of  the  search  direction. 
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Figure  3:  Function  interaction  for  the  GN-based  regularization  algorithm. 


7.  Simulation  Results 

In  this  section,  we  present  and  discuss  the  simulation  results  for  the  temperature  retrievals.  The  section  is 
divided  into  four  main  sections.  In  Section  7.1,  we  present  the  results  of  retrievals  using  Tikhonov  regularization. 
The  results  for  retrievals  using  TSVD  regularization  are  included  in  Section  7.2.  Similarly,  in  Section  7.3,  we  show 
some  results  of  retrievals  in  the  presence  of  noise.  Finally,  in  Section  7.4,  we  establish  a  comparison  between  the 
two  regularization  methods  for  both  noiseless  and  noisy  cases,  and  discuss  the  main  differences  of  the  generated 
results. 

7.1  Retrieval  Using  Tikhonov  Regularization 

Algorithm  2  was  used  to  solve  five  different  retrieval  problems,  each  with  a  different  temperature  initial  value 

.  The  first  retrieval  corresponds  to  a  tropopause  initial  guess  at  5  km.  The  real  temperature  profile  used  in  all 
the  retrievals  is  shown  in  Figure  4.  All  cases  are  noiseless. 
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Some  important  output  parameters  for  the  5km  simulation  are  included  in  Table  3.  The  first  column  of  the 
table  contains  the  regularization  parameter  computed  for  each  iteration  a.  The  linearization  residual  norm  for  the 
problem  is  included  in  column  number  two  and  shown  in  Figure  5,  The  residual  norm  for  this  case  is  shown  in 
Figure  5.  The  residual  norm  of  the  nonlinear  least-squares  problem  is  included  in  column  number  four  of  Table  3. 


Table  3:  Output  parameters  for  retrieval  using  Tikhonov  regularization  with  5  km  tropopause  initial  guess. 


a 

||r<")|| 

r-i 

lf(«) 

i+P’il 

0 

l.OOOOe-02 

i.5128e-K)2 

7.5648e401 

L1128e402 

1.3463e400 

- 

- 

1 

l.OOOOe-02 

7.5916e-K)l 

2.0702e400 

8.1140e-01 

8.5400e401 

9.7989e-01 

6.71646-02 

2 

7.9937e-05 

1.46956400 

8.4000e-01 

2.3128e-01 

1. 08996402 

3.014664-1 

5.45176-02 

3 

7.9937e-05 

7.7154e-01 

7.5103e-02 

1.6469e-02 

1.01216402 

3.94016-01 

1.9640e-02 

4 

7.9937e-05 

5.8276e-02 

1.7215e-02 

3.3725e-03 

1.0095e-K)2 

6.73996-01 

7.4457e-03 

5 

9.0892e-07 

2.7655e-02 

4.2862e-02 

1.0409e-02 

1.03136402 

2.97976-02 

1.58476-02 

6 

9.0892e-07 

2.366  le-02 

3.7977e-02 

7.943  le-03 

1.0336e-K)2 

3.9009e-03 

1.1890e-02 

7 

9.0892e-07 

1.0118e-02 

2.8880e-02 

5.7166e-03 

1.0367e402 

1.3407e-02 

1.2258e-02 

8 

9.0892e-07 

8.1872e-03 

3.3534e-02 

6.3896e-03 

1.0351C+02 

1.9156e-03 

1.3228e-02 

9 

9.0892e-07 

1.9293e-02 

1.4529e-02 

3.0001e-03 

1.0249e-K)2 

1.0896C-02 

9.4883e-03 

10 

9.0892e-07 

2.5358e-02 

2.2100e-02 

4.383  le-03 

1.0320e402 

5.9146e-03 

1.0161e-02 

11 

9.0892e-07 

9.4799e-03 

3.1526e-02 

6.0202e-03 

1.0332e+02 

1.5729e-02 

1.3742e-02 

12 

9.0892e-07 

3.0102e-02 

6.1365e-02 

1.206  le-02 

1.0369e-K)2 

2.0020e-02 

1.5602e-02 

13 

9.0892e-07 

1.5682e-02 

5.8453e-02 

1.3620e-02 

l,0362e+02 

1.4197e-02 

1.0736C-02 

14 

9.0892C-07 

3.7900e-02 

2.0979e-02 

4.4723e-03 

1.0286c-K)2 

2.1406e-02 

8.3572C-03 

CO  10 
CO 


Residual  Norm  for  14  iterations 


20 


100 


200  250  3C 

Degrees  (K) 

Figure  4:  Real  temperature  profile  used  for  the 
retrievals. 


0  5  10  15  20 

Iterations 

Figure  5:  Residual  norm  for  Tikhonov 
regularization  with  5  km  tropopause  initial  guess. 


We  expect  this  quantity  to  decrease  towards  zero  as  the  estimated  temperature  profile  app'oaches  the  real 
profile.  The  gradient  of  this  quantity  is  contained  in  the  next  column.  Column  number  six  provides  a  quantitative 
measure  of  how  the  updated  temperature  estimate  deviates  from  the  center  c.  Since  the  center  was  assigned  very 
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close  to  the  tenq)erature  initial  guess,  this  is  the  only  parameter  that  we  expect  to  increase  with  the  number  of 
iterations.  Finally,  the  last  two  columns  of  the  table  contain  the  quantities  used  in  the  algorithm  stopping  rule.  The 
convergence  criteria  was  satisfied  when  the  following  condition  prevailed. 


- r^<io''. 

(19) 

1+ 

where  . 

2 

For  this  tropopause  initial  guess,  the  solution  began  diverging  before  (19)  was  satisfied.  Figure  6  shows  the 
estimated  profile  for  foiuteen  iterations.  Here,  the  dotted  line  is  the  real  profile,  the  dashed  line  is  the  initial  guess, 
and  the  solid  line  represents  the  estimated  i»‘ofile.  The  best  approximation  possible  was  achieved  after  four 
iterations,  as  shown  in  Figure  7. 

Temperature  profile:  14  iterations  Temperature  profile:  4  iterations 


Degrees  (K)  Degrees  (K) 


Figure  6:  Estimated  temperature  profile  for  Figure  7:  Estimated  temperature  profile  for 

Tikhonov  regularization  with  5  km  tropopause  Tikhonov  regularization  with  5  km  tropopause 

initial  guess.  initial  guess:  best  approximation. 

The  second  case  of  the  retrievals  corresponds  to  a  tropopause  initial  guess  at  10  km.  The  corresponding 
output  data  is  shown  in  Table  4.  The  stopping  criterion  was  satisfied  after  seven  iterations.  The  estimated  profile  is 
shown  in  Figure  8.  It  is  clear  from  that  figure  that  an  initial  guess  with  a  10  km  tropopause  yields  a  better  solution 
than  the  5  km  case. 

A  tool  that  aids  in  the  analysis  of  the  solution  to  the  problem  of  interest  is  the  discrete  Picard  Condition.  To 
help  visualize  how  the  satisfaction  of  the  Picard  condition  relates  to  the  solution  of  the  problem,  the  plot  of  Figure  9 
is  generated  using  the  picardm  fimction  from  Regularization  Tools  (Hansen,  1996).  The  picard  plot  in  the  first  part 
of  the  figure  corresponds  to  the  Jacobian  determined  as  a  fimction  of  the  initial  guess  of  the  tenq)erature  profile. 
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Height  (km) 


Here,  we  see  that  the  Fourier  coefficients  |vfb|  do  not  decay  as  fast  as  the  singular  values  <J^,  and  hence  the 

discrete  Picard  condition  is  not  satisfied.  The  initial  temperature  profile  is  far  from  the  real  value.  In  the  second  part 
of  the  figure,  we  show  the  Picard  plot  for  the  Jacobian  corresponding  to  the  final  temperature  profile  estimate  after 
seven  iterations.  As  required  by  the  Picard  condition,  the  Fourier  coefficients,  on  the  average,  decay  faster  than  the 
corresponding  singular  values,  and  clearly  from  Figure  9,  we  can  say  that  the  estimated  profile  approximates  the  real 
profile.  The  accuracy  of  the  estimated  profiles  is  evaluated  in  Section  7.4.  Because  of  limitations  in  space,  we  will 
not  include  Picard  plots  for  every  experimental  case.  However,  the  capability  to  generate  these  plots  is  incorporated 
into  the  algorithm,  and  we  recommend  their  use  as  an  analysis  tool. 

Table  4:  Output  parameters  for  retrieval  using  Tikhonov  regularization  with  10  km  tropopause  initial  guess. 


a  J^{a)  h<“^=||T‘’^8+r'‘'^|  Ll") 

1 

2 

^  1  1 

Z 

l.OOOOe-02 

l.OOOOe-02 

3.8815e-05 

3.8815e-05 

3.8815e-05 

3.8815e-05 

3.8815e-05 

1.1694e-06 


4.2226e-K)l 
2.1061e+01 
4.408  le-01 
1.2543e-01 
2.1706e-02 
4.0675e-03 
7.7232e-04 
1.8579C-04 


2.1118e-K)l 

5.0422e-01 

1.5180e-01 

2.6442e-02 

4.8564e-03 

8.0563e-04 

9.9696C-05 

1.0971e-04 


2.8080e-H)l 

1.5695e-01 

3.0271e-02 

4.7559e-03 

9.8002e-04 

1.7120e-04 

2.2121e-05 

3.8968e-05 


1.3463e+00 

4.4359e+01 

4.9344e-K)l 

4.7391e+01 

4.7026e-K)l 

4.6980e+01 

4.6975e-K)l 

4.6975e+01 


9.5941e-01 

1.4311e+01 

2.8023e-01 

1.0152e-01 

1.7567e-02 

3.2926e-03 

5.8642e-04 


p(o}  •f'(»-l)|} 


-P’lL 


3.5229e-02 
1.6002e-02 
8.0636e-03 
3.3632e-03 
1.2373e-03 
3.374  le-04 
2.0819C-05 


Temperature  profile:  7  iterations 


Figure  8:  Estimated  temperature  profile  for  Tikhonov  regularization  with  10  km  tropopause  initial  guess. 
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Picard  plot:  Initial  guess 


Figure  9:  Picard  plots  for  Tikhonov  regularization  with  10  km  tropopause  initial  guess. 

The  third  retrieval  corresponds  to  a  tropopause  initial  guess  at  15  km.  Table  5  contains  the  output 
parameters  for  this  case.  Similarly,  Figure  10  shows  the  estimated  profile.  For  this  case  the  stopping  criteria  was 
satisfied  was  after  9  iterations.  When  supplied  with  a  20  km  tropopause  initial  guess,  the  results  obtained  were  those 
of  Table  6.  In  this  case,  the  program  stopped  after  10  iterations.  Figure  1 1  shows  the  estimated  profile. 


Temperature  profile:  8  iterations 


Degrees  (K) 

Figure  10:  Estimated  temperature  profile  for 
Tikhonov  regularization  with  15  km  tropopause 
initial  guess. 


Temperature  profile:  10  iterations 


200  250  300 

Degrees  (K) 

Figure  11:  Estimated  temperature  profile  for 
Tikhonov  regularization  with  20  km  tropopause 
initial  guess. 


The  last  retrieval  using  Tikhonov  regularization  was  performed  using  a  25  km  tropopause  initial  guess.  The  results 
for  this  case  are  presented  in  Table  7.  The  corresponding  estimated  profile  after  10  iterations  is  shown  in  Figure  12. 
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Height  (km) 


Table  5:  Output  parameters  for  retrieval  using  Tikhonov  regularization  with  15km  tropopause  initial  guess. 


0 

l.OOOOe-02 

4.4338e-K)l 

2.2172e+01 

3.2706e401 

L3463e-K)0 

- 

- 

1 

l.OOOOe-02 

2.21036+01 

2.4266e-01 

5.40856-02 

3.4050e-K)l 

9.62406-01 

2.75626-02 

2 

4.0903e-06 

1.59346-01 

2.43856-01 

4.76586-02 

5.87136+01 

1.89286+01 

3.63956-02 

3 

4.0903e-06 

1.67286-01 

7.6312e-02 

1.51136-02 

4.7665e^l 

6.7999e-03 

2.17746-02 

4 

4.0903e-06 

5.34206-02 

2.67726-02 

4.3906e-03 

4.36316+01 

1.08086-01 

1.18356-02 

5 

4.0903e-06 

1.71606-02 

9.58856-03 

1.6597e-03 

4.37486+01 

3.56486-02 

8.02796-03 

6 

4.0903e-06 

4.62876-03 

5.25976-03 

L1311e-03 

4.34196+01 

1.24746-02 

4.36996-03 

7 

4.0903e-06 

8.07166-03 

1.32946-02 

2.7206C-03 

4.36246+01 

3.41546-03 

7.26616-03 

8 

4.0903e-06 

8.51556-03 

4.91016-03 

9.5726e-04 

4.28416+01 

4.40086-04 

4.91096-03 

Table  6:  Output  parameters  for  retrieval  using  Tikhonov  regularization  with  20  km  tropopause  initial  guess. 


Temperature  profile:  8  iterations 


Degrees  (K) 

Figure  12:  Estimated  temperature  profile  for  Tikhonov  regularization  with  25  km  tropopause  initial  guess. 


7.2  Retrieval  Using  TSVD  Regularization 

Algorithm  3  was  used  to  solve  the  same  retrieval  problems  as  with  Tikhonov  regularization,  i.e.,  using  the 
same  initial  temperature  profiles.  For  the  5  km  tropopause  initial  guess,  the  output  parameters  of  Table  8  were 
generated.  The  stopping  criterion  in  colunm  seven  was  satisfied  after  eight  iterations.  The  estimated  profile  is  shown 
in  Figure  13. 

The  second  case  corresponds  to  a  10  km  tropopause  initial  guess.  The  corresponding  output  data  and  estimated 
profile  are  those  in  Table  9  and  Figure  14,  respectively.  There  were  a  total  of  seven  iterations  before  the  program 
stopped. 


Table  8:  Output  parameters  for  retrieval  using  TSVD  regularization  with  5  km  tropopause  initial  guess. 


a 

hW=||j<‘>)5+ 

Ml 
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1 
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1^  '  llz 
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0 
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1.5128e-K)2 
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1.1128e+02 

1.3463e+00 

- 

- 

1 

7 

7.5946e-K)l 

2.1081e+00 

8.3363e-01 

8.4629e+01 

9.7909e-01 

6.6598e-02 

2 

8 

1.9049e-K)0 

2.5056e-01 

6.5009e-02 

9.0127e401 

2.5489e-K)l 

2.8681e-02 

3 

9 

1.0692e-01 

1.9285e-01 

4.5707e-02 

9.8383e-K)l 

1.6243e-K)0 

2.1091e-02 

4 

10 

8,0371e-02 

1.9888e-01 

3.6400e-02 

9.9865e-K)l 

2.4575e-02 

1.7467e-02 

5 

11 

L7357e-01 

2.4854e-02 

5.1014e-03 

L0182e^2 

7.9415e-02 

7.1890e-03 

6 

12 

2.4444e-02 

1.5807e-03 

7.9438e-05 

1.0195e-K)2 

1.4557e-01 

1.0511e-03 

7 

14 

2.9849e-03 

1.5117e-03 

6.7952e-05 

1.0193e-K)2 

2.1395e-02 

3.0501e-04 

8 

16 

2.9505e-03 

1.5096e-03 

9.6279e-05 

1.0192e-K)2 

3.4277e-05 

1.0760e-04 
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Table  9;  Output  parameters  for  retrieval  using  TSVD  regularization  with  10  km  tropopause  initial  guess. 


a 

h(‘')=||j<‘>)5+ 

r'“>|| 

1''“’ . 

ir-i 

l  +  lhWj 

l  +  |[fW|[ 

0 

3 

4.2151e-K)l 

2.1118e+01 

2.8080e+01 

1.34636+00 

- 

. 

1 

7 

2.1070e401 

5.0748e-01 

1.5552e-01 

4.4161e+01 

9.55216-01 

3.50716-02 

2 

8 

5.0114e-01 

4.3163e-02 

1.0188e-03 

4.4633e^l 

1.37026+01 

4.50506-03 

3 

9 

6.6883e-02 

5.5095e-02 

L1095e-02 

4.6473e+01 

4.07036-01 

9.74736-03 

4 

13 

4.1516e-02 

1.4391e-02 

2.842  le-03 

4.7006e+01 

2.43566-02 

5.61806-03 

5 

14 

1.3429e-02 

1.0537e-03 

2,5637e-04 

4.6977e401 

2.77146-02 

1.47796-03 

6 

16 

6.1013e-04 

5.1023e-04 

9.5322e-05 

4.6990e+01 

1.28116-02 

1.25136-03 

7 

18 

4.2216e-04 

5.3989e-04 

9.9192e-05 

4,6990e-K)l 

1.87896-04 

3.17026-04 

Temperature  profile:  8  iterations  Temperature  profile:  7  iterations 


Degrees  (K)  Degrees  (K) 

Figure  13:  Estimated  temperature  profile  for  Figure  14:  Estimated  temperature  profile  for 

TSVD  regularization  with  5  km  tropopause  initial  TSVD  regularization  with  10  km  tropopause 

guess.  initial  guess. 


Table  10:  Output  parameters  for  retrieval  using  TSVD  regularization  with  15  km  tropopause  initial  guess. 


0  3 

1  7 

2  8 

3  9 

4  13 

5  14 

6  16 

7  20 

8  22 

9  22 


4.4139c-H)l 

2.2103e^l 

2.0381e-01 

4.7788e-02 

2.2475e-02 

3.5571e-02 

6.3035e-03 

L2986e-02 

1.1783e-02 

L0937e-02 


2.2172e401 

2.2531e-01 

4.8542e-02 

5.3115e-02 

4.2016e-02 

6.5137e-03 

7.4370e-03 

7.2804e-03 

6.9671e-03 

7.3184e-03 


3.2706e-K)l 
5.1662e-02 
6.2174e-03 
9.4329e-03 
8.0004e-03 
L3191e-03 
2.538  le-03 
2.4693e-4)3 
2.3180e-03 
2.2747e-03 


1.3463e+00 

3.4381e401 

3.6884e-K)l 

3.9692e401 

4.1131e-K)l 

4.1411e+01 

4.2060e-K)l 

4.2074e401 

4.2124e+01 

4.2448e-K)l 


9.5384e~01 

1.8191e-K)l 

1.4890e-01 

2.4756c«4)2 

1.2646C-02 

2.9085e-02 

6.5967e-03 

1.1893e-03 

8.3638C-04 


2.7807e-02 

7.7194e-03 

1.0630e-02 

8.7966C-03 

3.6844e-03 

5.5310e-03 

6.4503e-04 

1.1178e-03 

3.4538e-03 
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For  the  15  km  tropopause  initial  guess,  results  are  included  in  Table  10  and  Figure  15.  For  this  case,  nine 
iterations  were  needed  before  the  stopping  criterion  was  satisfied.  The  next  simulation  presented  corresponds  to  a 
tropopause  initial  guess  at  20  km.  For  this  case,  the  stopping  rule  was  satisfied  after  the  eighth  iterations.  Table  1 1 
contains  the  output  parameters  for  this  case.  The  estimated  profile  can  be  seen  in  Figure  19. 


Table  11;  Output  parameters  for  retrieval  using 


a 

h(“)=||j<»)5+ 

rW 

2 

PH. 

l  +  |h<“^ 

*  - . 

i+PiL 

0 

3 

8.6698e+01  4.3375e-K)l  6.3532e+01  1.3463e400 

- 

7 

8 
9 

13 

14 

15 

16 
17 


4.3085e4Ol 

9.4577e-01 

5.9868e-02 

2.8183e-02 

3.1125e-02 

1.0046e-02 

5.6516e-03 

4.9841e-03 


9.7295e-01 

5.5869e-02 

6.5188e-02 

5.1013e-02 

1.9630e-02 

1.0861e-02 

6.7127e-03 

1.8566e-03 


2.1238e-01 

8.1417e-03 

1.1613e-02 

9.8628e-03 

3.8637e-03 

2.4072e-03 

2.1867e-03 

4.0340e-04 


6.4713e+01 

7.1524e+01 

7.2517e+01 

7.3321e+01 

7.3841e401 

7.4350e4Ol 

7.5150e4Ol 

7.5435e+01 


9.8930e-01 

2.1657e401 

8.3587e-01 

3.0817e-02 

2.8538e-03 

2.0870e-02 

4.3694e-03 

6.6426e-04 


5.1368e-02 

9.3532e-03 

1.2733e-02 

1.0274e-02 

5.6148e-03 

5.5236e-03 

5.0050e-03 

3.3368e-03 


Temperature  profile;  9  iterations 


Degrees  (K) 

Figure  15:  Estimated  temperature  profile  for 
TSVD  regularization  with  15  km  tropopause 
initial  guess. 


Temperature  profile:  8  iterations 


Degrees  (K) 

Figure  16:  Estimated  temperature  profile  for 
TSVD  regularization  with  20  km  tropopause 
initial  guess. 


The  last  retrieval  case  corresponds  to  a  25  km  tropopause  initial  guess.  By  looking  at  the  output  data  in  Table  12,  we 
notice  that  the  solution  for  this  case  began  diverging  after  iteration  number  eight.  The  estimated  profile  in 
Figure  17  corresponds  to  14  iterations.  The  best  approximation  possible  corresponds  to  the  eighth  iteration  and  is 
shown  in  Figure  18. 


L 
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Table  12:  Output  parameters  for  retrieval  using  TSVD  regularization  with  25  km  tropopause  initial  guess. 


a 

PHI 

lb 

lb 

f'  >  IL 

li  1(2 

l  +  lh'-*! 

0 

3 

3.0000e+00 

5.9096e+01 

8.2997e401 

1.3463e-K)0 

- 

1 

7 

7.0000e+00 

2.3071e-K)0 

5.1721e-01 

9.4097e-K)l 

9.9159e-01 

7.4034e-02 

2 

8 

8.0000e400 

1.6798e-01 

2.2658e-02 

1.0034e402 

1.7727e+01 

1.8078e-02 

3 

9 

9.0000e+00 

1.6187e-01 

3.7741e-02 

1.0226e+02 

1.8306C4O0 

1.9883e-02 

4 

13 

1.3000e-K)l 

6.9324e-02 

1.3771e-02 

1.0318e4O2 

1.0819C-02 

1.0429e-02 

5 

14 

1.4000e+01 

4.7905e-02 

9.6947e-03 

1.0401e402 

9.1401e-02 

7.6560e-03 

6 

15 

1.5000e-K)l 

1.8327e-02 

4.1013e-03 

1.0475e+02 

9.7661e-03 

7.3739e-03 

7 

16 

1.6000e+01 

1.2303e-02 

4.2865e-03 

1.0589e+02 

2.4109e-02 

6.8166e-03 

8 

20 

2.0000e+01 

1.1624e-02 

3.7980e-03 

1.0599e+02 

9.5003e-03 

2.1652e-03 

9 

22 

2.2000e+01 

1.1980e-02 

3.7617e-03 

1.0606e402 

2.3413e-03 

2.6360e-03 

10 

22 

2.2000e-K)l 

1.1846e-02 

3.6780e-03 

1.0615e4D2 

4.6390e-03 

1.1528e-03 

11 

22 

2.2000e^l 

1.2086e-02 

3.5771e-03 

1.0625e+02 

2.6560e-03 

3.5291C-03 

12 

22 

2.2000e401 

1.2301e-02 

3.5823C-03 

1.0638e-K)2 

6.053  le-03 

2.6600C-03 

13 

22 

2.2000e401 

1.3061e-02 

3.6439e-03 

1.0692e+02 

2.9468e-03 

4.3784e-03 

14 

22 

2.2000e+01 

1.6494e-02 

4.0111e-03 

1.0808e+02 

2.8906e-03 

9.2406e-03 

15 

22 

2.2000e+01 

1.6609e-02 

3.9007e-03 

1.0795e+02 

6.2099e-03 

3.4094e-03 

Temperature  profile:  15  iterations  Temperature  profile:  8  iterations 


200  250  300  200  250  300 


Degrees  (K)  Degrees  (K) 

Figure  17:  Estimated  temperature  profile  for  Figure  18:  Estimated  temperature  profile  for 

TSVD  regularization  with  25  km  tropopause  TSVD  regularization  with  25  km  tropopause 

initial  guess.  initial  guess:  best  approximation. 

7.3  Retrievals  with  the  Addition  of  Noise 

The  purpose  of  this  Section  is  to  show  how  the  retrievals  are  affected  when  we  add  a  noise  vector  to  the 

brightness  temperatures  Tj .  We  define  the  measured  brightness  temperature,  ,  as 

Tfc  =  Tj,  +  e„ 
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where  is  the  actual  bri^tness  temperature.  The  noise  vector  was  white  Gaussian  noise  with  variance  equal 

to  the  noise  equivalent  temperature  difference  ( NEAT )  for  HIRS  and  MSU. 

We  performed  two  test  retrievals  for  each  regularization  method,  for  tropopause  initial  guesses  at  10  km 
and  15  km,  respectively.  The  estimated  profiles  for  the  first  test  case  are  shown  in  Figures  19  and  20.  We  can 
appreciate  fi^om  these  figures  that  the  results  are  very  inaccurate.  The  stopping  criteria  was  satisfied  after  a  certain 
number  of  iterations,  but  convergence  was  far  fi-om  being  achieved.  The  small  relative  change  in  the  residual  norm 
h  may  indicate  that  the  algorithm  is  traversing  a  region  in  which  the  surface  is  fairly  flat  (Seber  and  Wild,  1989). 
This  become  apparent  if  look  at  Table  13  for  Tikhonov  regularization.  We  can  see  from  the  third  column,  that  the 
residual  norm  h  remains  high  in  comparison  with  previous  retrievals,  where  the  order  of  magnitude  was  as  low  as 
10"^.  Also,  small  relative  changes  in  the  estimated  iwofile  indicate  that  the  algorithm  is  taking  short  steps. 
Table  13  shows  the  output  parameters  for  the  Tikhonov  regularization. 


Temperature  profile:  10  iterations 


Degrees  (K) 

Figure  19:  Estimated  temperature  profile  for 
noisy  Tikhonov  regularization  with  10  km 
tropopause  initial  guess. 


Temperature  profile:  7  iterations 


Degrees  (K) 

Figure  20:  Estimated  temperature  profile  for 
noisy  TSVD  regularization  with  10  km  tropopause 
initial  guess. 


Profiles  for  the  second  test  case  are  shown  in  Figures  21  and  22.  As  shown  in  the  figures,  results  are  very  similar  to 
those  of  the  previous  case.  An  interesting  fact  is  that  after  the  third  or  fourth  iteration  the  profile  did  not  change 
significantly.  In  fact,  if  we  stopped  the  program  after  the  fifth  iteration,  we  could  not  visually  notice  any  difference 
in  the  estimates.  This  could  mean  that  the  regularization  criterion  incorporated  into  the  algorithm  is  not  afpropriate 
for  the  noisy  problem.  In  the  following  section,  we  discuss  some  important  facts  regarding  the  selection  of  the 
regularization  parameter. 
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Table  13:  Output  parameters  for  noisy  retrieval  using  Tikhonov  regularization  with  10  km  tropopause  initial 
guess. 


a 

bw)^w 

PHI 

||f(-)  .f  (-»! 

II2 

1  1  z 

II  Ii2 

l  +  lh'-’ 

0 

l.OOOOe-02 

4.3464e401 

2.1744e+01 

2.9234e401 

1.3463e400 

- 

1 

l.OOOOe-02 

2.1765e401 

8.3060e-01 

1.4553e-01 

5.5176e4{)l 

9.5320e-01 

4.3034e-02 

2 

5.5032e-06 

1.5298e+00 

8.3060e-01 

1.4552e-01 

5.5i77e401 

7.9986e+00 

1.4583e-04 

3 

5.5032e-06 

1.6271e-K)0 

8.3059e-01 

1.4552e-01 

5.5178e+01 

3.7048e-02 

1.5493e-04 

4 

5.5032e-06 

1.4673e^0 

8.3057e-01 

1.4552e-01 

5.5179e+01 

6.4776e-02 

8.3771e-05 

5 

5.5032e-06 

1.48226+00 

8.3058e-01 

1.4551e-01 

5.5180e-K)l 

5.9881e-03 

8.7758e-05 

6 

5.5032e-06 

1.5079e+00 

8.3055e-01 

1.4551e-01 

5.5182e+01 

1.0249C-02 

9.2822e-05 

7 

5.5032e-06 

1.5447e+00 

8.3057e-01 

1.4552e-01 

5.5184e+01 

1.4468e-02 

9.8604e-05 

8 

5.5032e-06 

1. 59756400 

8.3054e-01 

1.4552e-01 

5.5186e401 

2.0321e-02 

1.0565e-04 

9 

5.5032e-06 

1.4605e+00 

8.3056e-01 

1.4552e-01 

5.5188e401 

5.5678e-02 

5.6948e-05 

10 

5.5032e-06 

1.4583e+00 

8.3056e-01 

1.4552e-01 

5.5189e401 

8.6755e-04 

5.9344e-05 

Temperature  profile:  20  iterations 
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Figure  21:  Estimated  temperature  profile  for 
noisy  Tikhonov  regularization  with  20  km 
tropopause  initial  guess. 


Temperature  profile:  14  iterations 


200  250  300 


Degrees  (K) 

Figure  22:  Estimated  temperature  profile  for 
noisy  TSVD  regularization  with  20  km  tropopause 
initial  guess. 


7.4  Comparisons  of  the  Retrievals 

In  this  section,  we  will  compare  the  accuracy  of  the  estimated  temperature  profiles  for  Tikhonov 
regularization  and  TSVD  regularization.  We  will  also  compare  the  retrievals  with  the  addition  of  noise.  Finally,  we 
will  discuss  the  main  factors  that  influence  the  results  of  the  retriev'als. 

To  evaluate  the  accuracy  of  the  estimated  profiles,  we  used  the  root  mean  square  (RMS)  error  metric.  The 
RMS  error  can  be  computed  as: 

RMS  error  = 
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where  =  29  is  the  total  number  of  atmospheric  layers,  T,  are  the  elements  of  the  real  profile  vector,  and  are 
elements  of  the  estimated  profile.  The  RMS  errors  of  the  estimated  temperature  profile  for  all  the  retrievals 
presented  here  are  included  in  Table  14. 

Table  14:  Root  Mean  Square  (RMS)  error  of  estimated  temperature  profiles. 


RMS  Error 


Tropopause 

Tikhonov 

Tildionov 

TSVD 

TSVD 

initial  guess  (km) 

(noiseless) 

(noisy) 

(noiseless) 

(noisy) 

5 

4.0728*,  2.5171** 

- 

3.5182 

- 

10 

0.9964 

6.9729 

1.0193 

14.8480 

15 

1.5808 

7.6453 

1.2803 

10.0670 

20 

1.9759 

- 

1.5111 

- 

25 

2.1807 

- 

3.7580*,  1.6970** 

- 

*  obtained  for  last  iteration,  **  best  ^proximation  obtained 


Based  on  the  RMS  errors  of  Table  14,  the  best  approximations  were  obtained  for  retrievals  with  a  tropopause  initial 
guess  at  10  km.  This  is  also  what  we  concluded  just  by  looking  at  the  figures  of  the  estimated  profiles.  This  is  not  an 
unexpected  outcome,  since  this  initial  guess  was  the  closest  to  the  real  profile.  For  the  real  temperature  profile,  the 
tropopause  is  at  approximately  12  km.  The  information  in  Table  14  also  shows  that  the  RMS  error  increases  as  the 
tropopause  initial  guess  deviates  from  the  real  value.  Therefore,  the  accuracy  of  the  retrievals  is  dependent  on  the 
initial  guess. 

The  accuracy  of  the  estimated  profiles  also  depends  on  the  regularization  parameter  X.  Unfortunately,  while 
this  parameter  is  as  important  for  the  accuracy  of  the  retrievals  as  the  initial  guess,  its  computation  is  not  as  easy. 
The  selection  criteria  that  we  used  for  the  computation  of  X  provided  good  regularization  for  most  cases.  Moreover, 
this  criteria  showed  to  be  inadequate  for  several  of  the  retrieval  cases.  Let  us  take  as  an  example  the  first  retrieval 
case,  using  Tikhonov  regularization  with  a  tropopause  initial  guess  at  5  km,  shown  in  Figure  5.  For  the  X  selection 
criteria  implemented  in  our  algorithm,  the  problem  did  not  yield  an  acceptable  solution  after  14  iterations.  It  started 
approximating  the  exact  solution,  but  then  started  diverging  in  the  fifth  iteration.  We  could  think  at  first  that  we 
selected  a  bad  initial  guess,  or  that  our  algorithm  is  just  too  sensitive  to  the  initial  guess.  But  if  we  modify  our 

criteria  for  the  selection  of  the  regularization  parameter,  and  start  with  =  0.1  instead  of  0.01  we  will  obtain  the 
profile  shown  in  Figure  23.  The  RMS  error  was  improved  from  4.0728  to  2.3333.  The  stopping  criterion  for  this 
case  was  satisfied  after  8  iterations.  Table  15  shows  the  output  data  for  this  case.  Similarly,  for  the  retrieval  using 
TS  VD  regularization  and  the  25  km  tropopause  initial  guess,  a  modification  of  the  regularization  parameter  returned 
the  improved  estimated  profile  of  Figure  24.  Table  16  contains  the  sequence  of  values  for  X,  and  the  returned  output 
data.  The  RMS  error  for  this  case  improved  from  3.7580  to  1.6137. 

The  results  shown  in  Figures  23  and  24  are  very  important  since  they  show  the  crucial  role  of  the 
regularization  parameter  in  the  retrieval  problem.  The  simple  fact  that  we  let  X  to  increase  from  one  iteration  to  the 
other,  will  prevent  the  solution  from  achieving  convergence.  To  show  this,  let  us  consider  Figure  25.  This  figure 
corresponds  to  a  problem  solved  with  a  previously  implemented  algorithm.  This  algorithm  was  a  result  of  our 
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first  attempts  to  solve  our  retrieval  problem.  It  was  implemented  using  a  different  criteria  for  the  selection  of  X. 
The  criteria  used  was  very  simple;  for  each  iteration  a  choose  a  that  optimally  balances  the  solution  norm  and 
the  residual  norm  only  using  the  L-curve  method.  While  this  criteria  is  very  useful,  it  has  a  major  drawback  when 
used  alone.  The  value  of  X  is  allowed  to  increase  from  one  iteration  to  the  next.  Figure  26  shows  the  sequence  of 
values  for  the  regularization  parameter.  Here,  we  can  appreciate  the  oscillations  in  the  value  assigned  for  each 
iteration. 


Figure  23:  Temperature  retrieval  for  Tikhonov 
regularization  with  5  km  tropopause  initial  guess 
and  modified  regularization  parameter. 


Figure  24:  Temperature  retrieval  for  TSVD 
regularization  with  25  km  tropopause  initial  guess 
and  modified  regularization  parameter. 


Table  15:  Output  parameters  for  retrieval  using  Tikhonov  regularization  with  5  km  tropopause  initial  guess 
and  modified  regularization  parameter. 


a 

|,(a)=||j(a)g+ 

r'-'ll 

r(°) 

PHI 

2 

^  ' 

2 

i+|fr'“>IL 

0 

l.OOOOe-02 

1.5128e-K)2 

7.5648e-K)l 

1.1128e-K)2 

1.3463e-K)0 

- 

- 

1 

l.OOOOe-02 

7.5916e-K)l 

2.0702e-K)0 

8.1140e-01 

8.5400e-K)l 

9.7989e-01 

6.7164e-02 

2 

l.OOOOe-02 

1.9247e+00 

2.6029e-01 

3.4056e-02 

8.4792e+01 

2.5298c-K)l 

2.2973e-02 

3 

8.3729e-05 

4.3614e-01 

5.2589e-01 

1.6853e-01 

1.0553e4O2 

1.0365(^K)0 

4.5213e-02 

4 

8.3729e-05 

4.8317e-01 

4.5579e-02 

1.1953e-02 

1.0081e4{)2 

3.1706e-02 

1.8099e-02 

5 

8.3729e-05 

3.7959e-02 

8.7107e-03 

1.8436e-03 

1.0092e4{)2 

4.2893C-01 

5.1452e-03 

6 

8.3729e-05 

8.342  le-03 

5.1052e-04 

7.1808e-05 

1.0100e-K)2 

2.9372C-02 

1.2623e-03 

7 

8.3729C-05 

4.6665e-04 

3.1796e-04 

9.0537e-06 

1.0100c-K)2 

7.8718e-03 

1.9518e-04 

8 

5.3775e-07 

6.3381e-04 

3.1415e-04 

5.7922e-06 

1.0100e-K)2 

1.6705e-04 

1.051  le-05 

For  the  retrievals  with  the  addition  of  noise,  it  was  not  possible  to  achieve  convergence  with  the  selected 
regularization  parameters.  Unlike  in  other  cases,  the  various  sequences  tried  were  not  successful  at  regularizing  the 
problem  adequately.  Moreover,  we  cannot  assure  without  studying  the  problem  further  that  a  modified 
regularization  parameter  alone  will  solve  the  noisy  problem. 
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Height  (km) 


Table  16:  Output  parameters  for  retrieval  using  TSVD  regularization  with  25  km  tropopause  initial  guess 
and  modified  regularization  parameter. 


a 

h(<-)=||j<“)5+ 

r«»|i 

PH. 

hVi).hW| 

+ 

0 

3 

1.1813e4<»2 

5.9096e+01 

8.2997e4<)l 

1.3463e4O0 

- 

- 

1 

3 

5.8871e4<)l 

2.2342e4fr0 

3.4294e-01 

8.7470e+01 

9.8982e-01 

6.9435e-02 

2 

7 

2.0914e^<)0 

3.9102e-01 

1.2501e-01 

9.9069e+01 

1.8367e-K)l 

3.0033e-02 

3 

8 

3.6311e-01 

6.3232e-02 

6.9684e-03 

1.0057e+02 

1.2679ef00 

7.4709e-03 

4 

9 

7.6942e-02 

8.7632e-02 

2.1264e-02 

1.023  le+02 

2.6572e-01 

1.6747e-02 

5 

10 

7.4146e-02 

1.7880e-02 

2.7300e-03 

1.0278e+02 

2.6034e-03 

6.1012e-03 

6 

12 

1.7040e-02 

3.2845e-02 

6.5873e-03 

1.0325e402 

5.6148e-02 

6.7660e-03 

7 

12 

2.6887e-02 

6.0787e-03 

1.1864e-03 

1.0357e+02 

9.5888e-03 

3.1218e-03 

8 

13 

6.5282e-03 

6.7191e-03 

1.3494e-03 

1.0367e+02 

2.0227e-02 

2.2409e-03 

9 

14 

9.5719e-03 

9.6549e-03 

1.9501e-03 

1.0402e^2 

3.0148e-03 

3.2054e-03 

10 

15 

1.2225e-02 

1.8515e-02 

4.994  le-03 

1.0472e4O2 

2.6215e-03 

7.5048e-03 

11 

15 

1.3038e-02 

6.7517e-03 

1.3980e-03 

1.0524e+02 

8.0244e-04 

4.3044e-03 

Tikhonov  Regularization:  10  iterations 


Figure  25:  Retrieval  using  only  the  L-curve  for  the 
selection  of  the  regularization  parameter. 
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X 1 0  Regularization  parameter 


Figure  26:  Regularization  parameter  selected 
using  only  the  L-curve. 


8.  Conclusions 

In  this  project,  we  developed  an  algorithm  for  nonlinear  retrieval  problems  in  atmospheric  remote  sensing 
based  on  the  Gauss-Newton  method  to  solve  nonlinear  least  square  problems.  Regularization  was  applied  to  the 
underdetermined  linear  least-squares  problem  of  computing  the  search  direction  using  TiWionov  and  TSVD 
regularization  methods  for  linear  inverse  problems.  The  method  was  appUed  to  the  nonlinear  problem  of 
atmospheric  temperature  retrieval  from  HIRS/2  and  MSU  radiometry.  A  set  of  MATLAB  macros  implementing  the 
algorithms  was  developed. 

Simulation  studies  were  performed  to  understand  the  performance  of  the  proposed  method.  Simulations  and 
Jacobian  calculations  were  made  using  GLA  TOYS  fast  radiative  transfer  code  from  NASA  Goddard  Space  Fhght 
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Center.  Three  key  factors  affect  the  performance  of  the  retrieved  profile:  (1)  the  initial  guess,  (2)  the  updating 
criterion  for  the  regularization  parameter,  and  (3)  the  stopping  criterion.  In  our  simulation  studies,  temperature 
profiles  with  5  km,  10  km,  15  km,  20  km,  and  25  km  tropopause  initial  guesses  were  used.  The  best  performance 
was  achieved  with  those  profiles  close  to  the  actual  profile.  The  implementations  of  the  retrieval  algorithm  produced 
similar  results  for  both  Tikhonov  and  TSVD  regularization.  The  root  mean  square  (RMS)  enor  for  the  estimated 
temperature  profiles  varied  fi'om  0.9964  to  4.0728  for  the  noiseless  case.  Most  of  the  noiseless  cases  studied, 
produced  a  solution  in  less  than  10  iterations. 

We  saw  that  the  regularization  parameter  for  both  methods  should  be  a  nonincreasing  sequence.  That 
condition  guarantees  that  regularization  is  reduced  as  the  algorithm  progresses.  A  good  stopping  criterion  will 
guarantee  that  the  algorithm  will  stop  before  the  profile  estimate  start  to  behave  erratically.  In  our  simulations,  we 
saw  that  when  these  conditions  are  not  met  the  algorithm  could  stay  close  to  the  initial  guess  or  could  produce  a 
useless  highly  oscillatory  solution.  The  criteria  implemented  in  the  algorithms  worked  in  most  of  the  examples 
presented.  However,  we  showed  with  a  couple  of  examples  that  we  were  capable  of  obtaining  better  estimates  if  the 
algorithm  was  stopped  earlier  or  by  changing  some  parameters  in  the  regularization  parameter  update. 

We  worked  out  some  simulations  with  white  Gaussian  noise  added  to  the  simulated  bri^tness  temperature. 
The  results  were  unsatisfactory.  The  retrievals  for  noisy  cases  produced  RMS  errors  considerably  high  from 
6.9729°K  to  14.8480°K.  The  criteria  for  selecting  the  regularization  parameter  in  the  noiseless  case  seemed  to  be 
inappropriate  for  the  noisy  case.  In  most  of  the  noisy  cases  studied,  the  solution  was  fixed  at  a  value  after  the  first 
four  or  five  iterations.  This  point  out  to  the  need  of  better  modeling  of  the  sensor  noise  process  and  the  need  of 
evaluating  the  performance  of  the  stopping  and  regularization  criteria  under  these  circumstances.  This  is  a  subject 
for  future  work. 

The  objectives  and  goals  of  the  original  proposal  were  quite  ambitious  for  one  year.  We  proposed  to  work 
with  simulations  of  the  DMSP  SSM/Tl  &  2  sensor.  Although,  we  were  capable  of  obtaining  radiative  transfer  code 
to  simulate  the  measured  brightness  temperatwe,  no  code  was  found  that  would  provide  us  with  the  Jacobian 
(sensitivity  functions)  needed  by  the  algorithm.  We  did  not  have  time  to  perform  the  experimental  validation  of  the 
proposed  algorithm.  We  are  in  search  of  a  validated  data  set  from  TOYS  and  DMSP  for  this  purpose.  Initial  talks 
with  researchers  at  the  GSFC  Data  Assimilation  Office  have  been  going  on  to  obtain  a  validated  data  set  for  TOYS. 
Mr.  Yincent  Falcone  will  provide  the  data  set  from  AF  Phillips  Laboratory. 

9.  Future  Work 

The  proposed  algorithms  need  to  be  validated  using  experimental  data  and  their  performance  compared  to 
standard  retrieval  algorithms  already  in  use  at  the  field. 

Future  work  needs  to  investigating  different  criteria  for  the  computation  of  the  regularization  parameter  X. 
New  and  improved  criteria  could  be  developed  that  adjust  according  to  other  important  parameters,  such  as  the 
residual  norm,  the  gradient  or  relative  changes  in  the  estimated  profiles,  just  to  mention  a  few.  Also,  the  retrievals 
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for  the  noisy  cases  should  be  investigated  further,  in  order  to  determine  what  modifications  to  the  regularization 
update  criterion  are  needed  to  obtain  an  accurate  retrieval. 

In  this  project,  we  looked  at  the  development  of  regularization  algorithms  that  regularize  a  linearization  of 
the  nonlinear  retrieval  problem.  Algorithms  that  look  at  the  regularization  of  the  original  nonlinear  problem  are  of 
interest  and  might  result  in  more  accurate  retrievals. 
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Abstract 

An  experimental  study  of  the  spray  characteristics  of  impinging  water  jets  was  conducted.  The  Phase 
Doppler  particle  analyzer  (PDPA)  and  video  photography  were  used  to  measure  droplet  size  and 
velocity  distributions  and  liquid  sheet  behavior.  Two  impinging  angles,  60  and  90  degrees,  were 
investigated.  The  injection  speed  is  varied  from  4  to  13  m/s,  or  Reynolds  number  from  4,600  to  15,000. 
Detailed  distribution  of  droplet  diameter  and  velocity  for  three  injection  speeds  (7.6,  9.0  and  1 1 .3  m/s) 
are  reported.  The  droplet  size  measurements  indicate  that  90  degree  impinging  angles  provide  finer 
atomization  than  that  of  60  degree  at  the  same  Reynolds  number.  The  better  atomization  were  also 
measured  as  Reynolds  number  inaeases  for  both  impinging  angles.  The  droplet  size  measurements 
indicate  that  the  Sauter  mean  diameter  (SMD)  increases  as  the  azimuthal  angle  increases.  This  trend 
is  a  reversal  of  the  existing  atomization  models.  The  droplet  velocity  distributions  were  found  either 
normal  or  bi-modal  depending  on  measuring  locations.  The  instantaneous  sizes  and  shapes  of  liquid 
sheet  were  measured.  The  liquid  sheet  instability  wave  was  clearly  observed  which  propagated 
upstream  to  the  injector.  The  spray  photographs  also  indicated  that  the  sprays  generated  by  90  degree 
injector  have  wider  spray  angles  and  are  more  three-dimensional  compared  than  those  of  60  degree 
injector. 


21-2 


/(^,^)  =  [(1-cos^-cos^)’  /sin^^]. 


And  do  is  the  initial  jet  diameter,  W,  is  the  Weber  number  based  on  the  injection  speed  and  diameter, 
20  and  9  are  the  impinging  angle  and  the  azimuthal  angle  from  the  z  axis,  respectively.  This  model 
predicts  that  the  Sauter  mean  diameter  (SMD)  of  droplets  decreases  with  the  increase  of  the 
injection  speed  and  the  impingement  angle.  For  a  given  injector  configuration  and  flow  rate,  the 
droplet  size  decreases  as  the  azimuthal  angle  increases. 

In  this  experimental  study,  the  Phase  Doppler  particle  analyzer  (PDPA)  and  video  photography  were 
used  to  measure  droplet  size  and  velocity  distributions  and  liquid  sheet  behavior.  Two  impinging  angles 
(60  and  90  degrees)  with  water  injection  speed  from  4  to  13  m/s  were  investigated.  Detailed  distribution 
of  droplet  diameter  and  velocity  at  three  injection  speeds  (7.6,  9.0  and  11.3  m/s)  are  reported  and 
compared  to  the  trend  predicted  by  existing  atomization  model®.  The  size,  shape  and  instability  of  liquid 
sheets  were  measured  by  the  instantaneous  photography. 

Experimental  setup 

Two-jet  impinging  jet  injectors,  depicted  in  Figure  1,  are  used  in  this  study.  These  injectors  are  made 
up  of  two  capillary  stainless  steel  tubes  with  1mm  inner  diameter  and  100  mm  length.  The  exit  orifices 
were  either  50  mm  or  80mm  apart  and  the  impinging  angles  (20)  are  either  60  or  90  degree.  The 
injector  is  mounted  downward  on  a  three-dimensional  traversing  system  in  order  to  have  spatially 
resolved  droplet  size  measurements.  A  water-proof  tank  ( 3  x  3  x  2  ft)  locates  about  2  ft  downstream  of 
the  injector  was  used  to  collect  and  drain  the  test  liquid.  A  40-galon  water  tank  pressurized  by  a 
gaseous  nitrogen  k-bottle  is  used  to  steadily  supply  the  test  fluids.  The  nitrogen  pressure,  injector 
pressure,  was  precisely  regulated,  and  the  water  flow  rate  was  calibrated  using  a  gradual  cylinder  and 
a  stop  watch.  The  flow  rate  was  constantly  monitored  during  the  tests. 

Two  optical  instruments  were  used  in  this  study.  Two  component  PDPA  was  used  for  simultaneous 
measurements  of  droplet  diameter  and  velocity.  The  current  PDPA  setup  can  measure  the  droplet  size 
up  to  200  pm  and  the  velocity  up  to  100  m/s.  The  data  reported  in  this  study  are  based  on  1,000 
droplet  samples.  The  liquid  sheet  behavior  was  investigated  by  a  30  frame  per  second  S-VHS  video 
system.  The  back-lighted  video  were  taken  using  a  stroboscope  with  the  pulse  length  duration  about  1 
ps. 

Results  and  Discussions 

Figure  2  shows  the  water  injection  speed  as  a  function  of  the  injection  pressure.  In  the  current  test 
article,  the  injector  length  (  100  mm)  to  diameter  (1  mm)  ratio  is  100.  The  injection  pressure  shows  a 
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linear  relationship  with  the  injection  speed  which  indicates  that  the  majority  of  the  pressure  loss  is  due 
to  the  friction  loss  within  the  injector  body.  The  Reynolds  number,  based  on  the  injector  diameter  and 
injection  speed,  is  from  4.600(  10  psid)  to  15,000(  80  psid)  .  The  flow  conditions  in  this  study  are 
turbulent,  and  the  injection  pressure  loss  is  found  to  be  consistent  with  the  estimated  friction  loss  of 
fully  developed  turbulent  pipe  flow.  Although  the  velocity  profiie  of  the  liquid  at  the  nozzle  orifice  was 
not  measured  in  this  study.  It  should  be  same  as  the  fully  developed  turbulent  pipe  flow  at  the  same 
Reynolds  number. 

The  stroboscope  back-lighted  instantaneous  photographs  of  spray  are  shown  in  Figures  3-7.  The  field 
of  view  of  these  photographs  is  roughly  34x44mm.  The  front  view  shows  the  formation  of  the  spray  in 
the  perpendicular  direction  to  the  plane  of  impinging  jets.  At  10  psid.  the  liquid  sheet  is  still  intact 
although  the  instability  wave  within  this  liquid  sheet  is  evident,  and  the  droplets  were  formed  at  the  edge 
of  the  sheet.  The  edge  of  liquid  sheet  rolls  up  due  to  the  surface  tension  force.  The  droplets  are 
confined  to  the  centerline  for  the  60  degree  injector  while  the  droplets  begin  to  disperse  in  the  y 
direction  for  the  90  degree  injector.  The  instability  wave  of  liquid  sheet  was  found  to  propagate  back  to 
the  injectors  for  both  cases.  The  variations  of  the  liquid  jet  diameter  were  clearly  captured  in  the  photos. 
These  variations  are  regular  and  have  the  fixed  spatial  frequencies.  The  higher  spatial  frequency  is 
observed  for  the  90  degree  injector. 

Some  general  conclusions  are  reached  from  the  photos  illustrated  in  Figures  3-7.  The  higher  injection 
speed  generates  better  atomized  spray  and  has  the  the  liquid  sheet  break-up  location  closer  to  the 
impingement  point.  The  ligaments  were  formed  from  the  liquid  sheet  with  some  dominated 
frequencies.  When  the  injection  speed  increases,  this  frequency  is  higher  and  the  ligament  size  is 
smaller.  In  all  test  conditions  except  the  10  psid  case,  the  ligaments  and  droplets  propagates  outward 
from  the  impingement  point.  The  90  degree  injector  atomizes  the  liquid  better  than  that  of  60  degree 
injector  at  the  same  flow  rate.  The  90  degree  sprays  disperse  wider  in  the  y-direction  (see  side  view  of 
photos)  than  the  60  degree  sprays. 

The  sprays  contain  ligaments  and  non-spherical  droplets  near  the  impingement  point  as  shown  in  the 
figures.  The  PDPA  was  used  to  measure  the  droplet  size  and  velocity  distribution  at  locations  as  close 
to  the  sprays  as  possible.  These  measurements  were  made  along  a  rectangular  box  (60x30  mm  and 
80x30  mm  for  the  90  and  60  degree  injectors,  respectively)  where  validation  rate  of  PDPA  is  greater 
than  80%,  The  velocity  vectors  of  the  measurements  at  three  flow  rate  (35,  46  and  66  psid)  are  shown 
in  Figures  8  and  9.  From  the  directions  of  droplet  velocity  vectors,  it  is  clear  that  droplets  at  all 
measuring  points  originates  from  the  impingement  point.  The  droplet  dispersion  can  be  neglected 
between  the  impingement  point  and  the  measurement  locations.  These  velocity  measurements  are 
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consistent  with  the  flow  visualization  shown  in  Figures  3-7.  The  contours  of  “continuous”  liquid  sheet, 
based  on  the  video  recording  tapes,  are  also  depicted  in  these  figures.  As  the  flow  rate  increase,  the 
liquid  sheet  is  wider.  Generally  speaking,  the  liquid  sheets  generated  from  the  90  degree  injector  are 
wider. 

The  droplet  size  measurements  along  the  rectangle  box  as  depicted  in  the  Figures  8  and  9  were 
converted  to  the  polar  coordinate.  The  impingement  point  is  the  origin.  The  Sauter  Mean  Diameter 
(SMD)  as  a  function  of  the  azimuthal  angle  is  presented  in  Figures  10  and  11.  The  droplet  size 
decreases  as  the  injection  speed  increases  for  both  impinging  angles.  The  90  degree  injector  produces 
better  atomization  than  that  of  the  60  degree  injector. 

The  existing  atomization  models  predict  the  droplet  size  decreases  as  the  azimuthal  angle  from  the 
vertical  axis  increases.  However,  the  SMD  measurements  conducted  in  this  study  shows  the  reverse 
trend.  Kang  et  al®.  recently  conducted  the  droplet  size  measurements  on  the  similar  sprays  using 
holographic  method.  The  locations  of  their  measurements  are  around  the  edge  of  the  liquid  sheet  and 
are  not  limited  to  the  spherical  droplets.  Although  their  data  are  scattered  as  a  function  of  the  azimuthal 
angle,  they  do  show  the  trend  consistent  to  the  SMD  measurements  conducted  in  this  study:  the  SMD 
increases  as  the  azimuthal  angle  increases.  They  also  compared  their  experimental  results  with  three 
different  models,  and  all  models  predicted  the  reverse  trend  of  the  droplet  size  as  a  function  of  the 
azimuthal  angle.  Experimental  results  from  the  this  study  and  Kang  et  al®.  suggests  that  the  existing 
models  to  predict  the  atomization  process  of  the  impinging  injectors  should  be  re-examined. 

Droplet  size  and  velocity  distribution  at  four  locations  (40,  45,  50  and  50mm  downstream  of  the 
impingement  point)  aiong  the  centerline  of  the  90  degree  injector  is  shown  in  Figure  12.  All  droplet  size 
distributions  can  be  fitted  with  log-normal  distributions  with  confidence.  However,  the  trend  of  the  axial 
velocity  distributions  is  interesting.  Near  the  liquid  sheet  (40mm),  it  is  a  bi-modal  distribution  with  the 
higher  dropiet  population  density  in  the  high  velocity  group,  then  the  higher  droplet  population  density 
switches  to  the  lower  speed  group,  and  finally  (65mm)  it  becomes  a  normal  distribution.  To  further 
examine  this  trend,  the  correlation  between  the  droplet  size  and  axial  velocity  are  shown  in  Figure  13.  It 
is  clear  that  two  groups  of  droplet  were  formed  from  the  liquid  sheet.  The  group  containing  large 
droplets  moves  outward  at  the  higher  velocity.  As  this  group  of  droplets  moves  downstream, 
secondary  atomization  may  occurs  on  the  larger  droplets.  This  reduces  the  droplet  population  density 
of  this  group  compared  to  that  of  the  other  group  (traveling  at  lower  velocity).  At  far  downstream,  these 
two  groups  of  droplets  merge  into  a  single  group.  These  droplet  size  and  velocity  distribution 
measurements  indicate  that  the  atomization  mechanisms  of  the  impinging  jets  may  be  far  more 
complicated  than  the  mechanisms  postulated  in  the  existing  models.  Two  physical  mechanisms  may 
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co-exist  to  generate  two  different  groups  of  the  droplets.  Further  study  to  explore  these  physical 
mechanisms  are  recommended. 

Conclusions 

Impinging  jets  with  two  impinging  angles  (60  and  90  degrees)  were  investigated  in  this  study.  The  liquid 
sheet  instability  wave  was  clearly  observed  and  can  propagate  upstream  to  the  injector.  The  higher 
injection  speed  generates  better  atomized  spray  and  has  the  liquid  sheet  break-up  location  closer  to  the 
impingement  point.  When  the  injection  speed  increases,  this  ligament  size  becomes  smaller.  The 
droplet  size  measurements  by  PDPA  indicate  that  the  droplet  size  decreases  as  the  injection  speed 
increases  for  both  impinging  angles.  The  90  degree  injector  atomizes  better  than  the  60  degree 
injector.  However,  the  measured  droplet  size  increases  as  the  azimuthal  angle  increases.  This 
behavior  is  not  consistent  with  predictions  from  the  existing  models.  Two  distinct  groups  of  droplets, 
traveling  at  different  velocities,  were  found  near  the  liquid  sheet.  It  is  clear  that  existing  models  in  the 
literature  have  to  be  improved  in  order  to  predict  the  atomization  process  of  the  impinging  jets. 
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Figure  2  Injection  Speed  as  a  Function  of  Injection  Pressure 


Figure  3  Photograph  of  Spray  Formed  by  Two  Impinging  Jets:  10  psid 
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Figure 
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Figure  8  Droplet  Velocity  Vector  Plot:  60“  Impinging  angle 
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Figure  9  Droplet  Velocity  Vector  Plot:  90°  Impinging  angle 
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Figure  10  Sauter  Mean  Diameter  (SMD)  as  a  function  angle:  60° 
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Figure  12  Distribution  of  Droplet  Size  and  Velocity 


Figure  13  Correlation  between  Droplet  Size  and  velocity 
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